Overview

Brought to you by YData

Dataset statistics

Number of variables39
Number of observations404566
Missing cells13965606
Missing cells (%)88.5%
Duplicate rows16852
Duplicate rows (%)4.2%
Total size in memory120.4 MiB
Average record size in memory312.0 B

Variable types

Unsupported16
Categorical9
Text14

Alerts

Unnamed: 23 has constant value "0.0"Constant
Unnamed: 26 has constant value "0.0"Constant
Unnamed: 27 has constant value "4.0"Constant
Unnamed: 29 has constant value "av4"Constant
Unnamed: 30 has constant value "2.0"Constant
Unnamed: 31 has constant value "3o 5"Constant
Unnamed: 34 has constant value "Bihar "Constant
Unnamed: 37 has constant value "0.31"Constant
Unnamed: 38 has constant value "0.0"Constant
Dataset has 16852 (4.2%) duplicate rowsDuplicates
Unnamed: 15 is highly overall correlated with Unnamed: 16High correlation
Unnamed: 16 is highly overall correlated with Unnamed: 15High correlation
Carrier is highly imbalanced (96.9%)Imbalance
Unnamed: 15 is highly imbalanced (89.1%)Imbalance
Unnamed: 16 is highly imbalanced (53.3%)Imbalance
Name has 52122 (12.9%) missing valuesMissing
Gender has 394673 (97.6%) missing valuesMissing
JobTitle has 345516 (85.4%) missing valuesMissing
CompanyName has 400061 (98.9%) missing valuesMissing
Email has 356314 (88.1%) missing valuesMissing
Facebook has 394619 (97.5%) missing valuesMissing
Twitter has 402495 (99.5%) missing valuesMissing
Unnamed: 10 has 402768 (99.6%) missing valuesMissing
Unnamed: 11 has 403444 (99.7%) missing valuesMissing
Unnamed: 12 has 393095 (97.2%) missing valuesMissing
Unnamed: 13 has 364169 (90.0%) missing valuesMissing
Unnamed: 14 has 357956 (88.5%) missing valuesMissing
Unnamed: 15 has 393841 (97.3%) missing valuesMissing
Unnamed: 16 has 403854 (99.8%) missing valuesMissing
Unnamed: 17 has 404277 (99.9%) missing valuesMissing
Unnamed: 18 has 404453 (> 99.9%) missing valuesMissing
Unnamed: 19 has 404539 (> 99.9%) missing valuesMissing
Unnamed: 20 has 404559 (> 99.9%) missing valuesMissing
Unnamed: 21 has 404560 (> 99.9%) missing valuesMissing
Unnamed: 22 has 404562 (> 99.9%) missing valuesMissing
Unnamed: 23 has 404563 (> 99.9%) missing valuesMissing
Unnamed: 24 has 404564 (> 99.9%) missing valuesMissing
Unnamed: 25 has 404563 (> 99.9%) missing valuesMissing
Unnamed: 26 has 404565 (> 99.9%) missing valuesMissing
Unnamed: 27 has 404565 (> 99.9%) missing valuesMissing
Unnamed: 28 has 404564 (> 99.9%) missing valuesMissing
Unnamed: 29 has 404565 (> 99.9%) missing valuesMissing
Unnamed: 30 has 404565 (> 99.9%) missing valuesMissing
Unnamed: 31 has 404565 (> 99.9%) missing valuesMissing
Unnamed: 32 has 404566 (100.0%) missing valuesMissing
Unnamed: 33 has 404566 (100.0%) missing valuesMissing
Unnamed: 34 has 404565 (> 99.9%) missing valuesMissing
Unnamed: 35 has 404566 (100.0%) missing valuesMissing
Unnamed: 36 has 404566 (100.0%) missing valuesMissing
Unnamed: 37 has 404564 (> 99.9%) missing valuesMissing
Unnamed: 38 has 404564 (> 99.9%) missing valuesMissing
Number is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 17 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 18 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 19 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 20 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 21 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 22 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 25 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 32 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 33 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 35 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 36 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-07-17 15:27:41.875874
Analysis finished2024-07-17 15:27:54.980402
Duration13.1 seconds
Software versionydata-profiling vv4.9.0
Download configurationconfig.json

Variables

Number
Unsupported

REJECTED  UNSUPPORTED 

Missing0
Missing (%)0.0%
Memory size3.1 MiB

Carrier
Categorical

IMBALANCE 

Distinct5
Distinct (%)< 0.1%
Missing3
Missing (%)< 0.1%
Memory size3.1 MiB
Telenor
401345 
Idea
 
2705
Airtel
 
509
BSNL
 
3
Carrier
 
1

Length

Max length7
Median length7
Mean length6.9786609
Min length4

Characters and Unicode

Total characters2823308
Distinct characters17
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)< 0.1%

Sample

1st rowTelenor
2nd rowTelenor
3rd rowTelenor
4th rowTelenor
5th rowTelenor

Common Values

ValueCountFrequency (%)
Telenor 401345
99.2%
Idea 2705
 
0.7%
Airtel 509
 
0.1%
BSNL 3
 
< 0.1%
Carrier 1
 
< 0.1%
(Missing) 3
 
< 0.1%

Length

2024-07-17T20:57:55.029262image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-17T20:57:55.096934image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
ValueCountFrequency (%)
telenor 401345
99.2%
idea 2705
 
0.7%
airtel 509
 
0.1%
bsnl 3
 
< 0.1%
carrier 1
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
e 805905
28.5%
r 401857
14.2%
l 401854
14.2%
T 401345
14.2%
n 401345
14.2%
o 401345
14.2%
a 2706
 
0.1%
d 2705
 
0.1%
I 2705
 
0.1%
i 510
 
< 0.1%
Other values (7) 1031
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2418736
85.7%
Uppercase Letter 404572
 
14.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 805905
33.3%
r 401857
16.6%
l 401854
16.6%
n 401345
16.6%
o 401345
16.6%
a 2706
 
0.1%
d 2705
 
0.1%
i 510
 
< 0.1%
t 509
 
< 0.1%
Uppercase Letter
ValueCountFrequency (%)
T 401345
99.2%
I 2705
 
0.7%
A 509
 
0.1%
B 3
 
< 0.1%
S 3
 
< 0.1%
N 3
 
< 0.1%
L 3
 
< 0.1%
C 1
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
Latin 2823308
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 805905
28.5%
r 401857
14.2%
l 401854
14.2%
T 401345
14.2%
n 401345
14.2%
o 401345
14.2%
a 2706
 
0.1%
d 2705
 
0.1%
I 2705
 
0.1%
i 510
 
< 0.1%
Other values (7) 1031
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2823308
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
e 805905
28.5%
r 401857
14.2%
l 401854
14.2%
T 401345
14.2%
n 401345
14.2%
o 401345
14.2%
a 2706
 
0.1%
d 2705
 
0.1%
I 2705
 
0.1%
i 510
 
< 0.1%
Other values (7) 1031
 
< 0.1%

Name
Text

MISSING 

Distinct226584
Distinct (%)64.3%
Missing52122
Missing (%)12.9%
Memory size3.1 MiB
2024-07-17T20:57:55.292493image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Length

Max length309
Median length124
Mean length10.476067
Min length1

Characters and Unicode

Total characters3692227
Distinct characters1122
Distinct categories22 ?
Distinct scripts29 ?
Distinct blocks50 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique204946 ?
Unique (%)58.1%

Sample

1st rowP N. R
2nd rowRajat Arora
3rd rowAnil Giri Dalia Speciliest
4th rowZameer
5th rowNaveen Naveen
ValueCountFrequency (%)
kumar 24102
 
3.6%
singh 8564
 
1.3%
2 7200
 
1.1%
raj 5900
 
0.9%
yadav 5015
 
0.8%
k 4138
 
0.6%
s 3968
 
0.6%
rahul 3899
 
0.6%
khan 3790
 
0.6%
ji 3301
 
0.5%
Other values (99922) 597994
89.5%
2024-07-17T20:57:55.593965image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 544659
 
14.8%
315735
 
8.6%
i 240983
 
6.5%
h 206630
 
5.6%
n 206139
 
5.6%
r 186429
 
5.0%
u 162787
 
4.4%
e 134275
 
3.6%
s 107807
 
2.9%
m 105488
 
2.9%
Other values (1112) 1481295
40.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2618249
70.9%
Uppercase Letter 636164
 
17.2%
Space Separator 315741
 
8.6%
Decimal Number 58523
 
1.6%
Other Letter 24334
 
0.7%
Other Punctuation 22720
 
0.6%
Spacing Mark 8215
 
0.2%
Nonspacing Mark 5258
 
0.1%
Dash Punctuation 1395
 
< 0.1%
Other Symbol 710
 
< 0.1%
Other values (12) 918
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2780
 
11.4%
1745
 
7.2%
1710
 
7.0%
1317
 
5.4%
1315
 
5.4%
1141
 
4.7%
1067
 
4.4%
974
 
4.0%
932
 
3.8%
847
 
3.5%
Other values (309) 10506
43.2%
Lowercase Letter
ValueCountFrequency (%)
a 544659
20.8%
i 240983
 
9.2%
h 206630
 
7.9%
n 206139
 
7.9%
r 186429
 
7.1%
u 162787
 
6.2%
e 134275
 
5.1%
s 107807
 
4.1%
m 105488
 
4.0%
t 97084
 
3.7%
Other values (261) 625968
23.9%
Other Symbol
ValueCountFrequency (%)
61
 
8.6%
36
 
5.1%
😍 27
 
3.8%
26
 
3.7%
® 25
 
3.5%
😎 25
 
3.5%
😘 23
 
3.2%
😊 21
 
3.0%
20
 
2.8%
🙏 17
 
2.4%
Other values (159) 429
60.4%
Uppercase Letter
ValueCountFrequency (%)
S 101250
15.9%
K 63657
10.0%
A 61960
9.7%
R 61035
9.6%
M 51572
 
8.1%
P 42007
 
6.6%
B 35673
 
5.6%
D 29483
 
4.6%
N 27372
 
4.3%
G 24314
 
3.8%
Other values (125) 137841
21.7%
Nonspacing Mark
ValueCountFrequency (%)
1318
25.1%
928
17.6%
848
16.1%
621
11.8%
538
10.2%
230
 
4.4%
152
 
2.9%
ಿ 120
 
2.3%
64
 
1.2%
62
 
1.2%
Other values (54) 377
 
7.2%
Spacing Mark
ValueCountFrequency (%)
3792
46.2%
1592
19.4%
ि 1450
 
17.7%
683
 
8.3%
137
 
1.7%
118
 
1.4%
93
 
1.1%
57
 
0.7%
43
 
0.5%
35
 
0.4%
Other values (31) 215
 
2.6%
Other Punctuation
ValueCountFrequency (%)
. 20205
88.9%
' 685
 
3.0%
@ 485
 
2.1%
* 331
 
1.5%
& 239
 
1.1%
# 216
 
1.0%
: 156
 
0.7%
? 101
 
0.4%
! 96
 
0.4%
55
 
0.2%
Other values (17) 151
 
0.7%
Decimal Number
ValueCountFrequency (%)
2 13588
23.2%
0 9421
16.1%
1 7698
13.2%
3 4828
 
8.2%
9 4438
 
7.6%
4 4427
 
7.6%
7 4350
 
7.4%
5 3333
 
5.7%
6 3272
 
5.6%
8 3119
 
5.3%
Other values (16) 49
 
0.1%
Modifier Letter
ValueCountFrequency (%)
7
25.0%
ᴿ 3
 
10.7%
2
 
7.1%
2
 
7.1%
ʳ 1
 
3.6%
1
 
3.6%
ˢ 1
 
3.6%
1
 
3.6%
ˡ 1
 
3.6%
1
 
3.6%
Other values (8) 8
28.6%
Currency Symbol
ValueCountFrequency (%)
$ 93
39.9%
40
17.2%
£ 28
 
12.0%
¥ 25
 
10.7%
¤ 17
 
7.3%
16
 
6.9%
¢ 6
 
2.6%
4
 
1.7%
1
 
0.4%
1
 
0.4%
Other values (2) 2
 
0.9%
Math Symbol
ValueCountFrequency (%)
| 36
26.7%
29
21.5%
+ 19
14.1%
~ 17
12.6%
× 12
 
8.9%
8
 
5.9%
5
 
3.7%
÷ 5
 
3.7%
2
 
1.5%
1
 
0.7%
Format
ValueCountFrequency (%)
15
46.9%
8
25.0%
4
 
12.5%
2
 
6.2%
2
 
6.2%
1
 
3.1%
Open Punctuation
ValueCountFrequency (%)
( 150
89.8%
{ 11
 
6.6%
5
 
3.0%
1
 
0.6%
Other Number
ValueCountFrequency (%)
² 8
72.7%
¹ 1
 
9.1%
1
 
9.1%
³ 1
 
9.1%
Modifier Symbol
ValueCountFrequency (%)
^ 7
46.7%
🏻 4
26.7%
` 3
20.0%
´ 1
 
6.7%
Close Punctuation
ValueCountFrequency (%)
) 153
90.0%
} 10
 
5.9%
7
 
4.1%
Space Separator
ValueCountFrequency (%)
315735
> 99.9%
  6
 
< 0.1%
Final Punctuation
ValueCountFrequency (%)
» 4
50.0%
4
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 1395
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 114
100.0%
Initial Punctuation
ValueCountFrequency (%)
« 4
100.0%
Control
ValueCountFrequency (%)
‘ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 3253425
88.1%
Common 399945
 
10.8%
Devanagari 33835
 
0.9%
Kannada 1982
 
0.1%
Arabic 1202
 
< 0.1%
Cyrillic 463
 
< 0.1%
Greek 434
 
< 0.1%
Inherited 227
 
< 0.1%
Gurmukhi 184
 
< 0.1%
Bengali 167
 
< 0.1%
Other values (19) 363
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 544659
16.7%
i 240983
 
7.4%
h 206630
 
6.4%
n 206139
 
6.3%
r 186429
 
5.7%
u 162787
 
5.0%
e 134275
 
4.1%
s 107807
 
3.3%
m 105488
 
3.2%
S 101250
 
3.1%
Other values (295) 1256978
38.6%
Common
ValueCountFrequency (%)
315735
78.9%
. 20205
 
5.1%
2 13588
 
3.4%
0 9421
 
2.4%
1 7698
 
1.9%
3 4828
 
1.2%
9 4438
 
1.1%
4 4427
 
1.1%
7 4350
 
1.1%
5 3333
 
0.8%
Other values (250) 11922
 
3.0%
Devanagari
ValueCountFrequency (%)
3792
 
11.2%
2780
 
8.2%
1745
 
5.2%
1710
 
5.1%
1592
 
4.7%
ि 1450
 
4.3%
1318
 
3.9%
1317
 
3.9%
1315
 
3.9%
1141
 
3.4%
Other values (77) 15675
46.3%
Kannada
ValueCountFrequency (%)
230
 
11.6%
165
 
8.3%
ಿ 120
 
6.1%
118
 
6.0%
93
 
4.7%
93
 
4.7%
81
 
4.1%
64
 
3.2%
62
 
3.1%
59
 
3.0%
Other values (43) 897
45.3%
Arabic
ValueCountFrequency (%)
ا 181
15.1%
م 110
 
9.2%
ی 93
 
7.7%
ن 80
 
6.7%
ر 78
 
6.5%
د 61
 
5.1%
ب 60
 
5.0%
و 50
 
4.2%
ح 46
 
3.8%
س 41
 
3.4%
Other values (40) 402
33.4%
Cyrillic
ValueCountFrequency (%)
н 51
 
11.0%
є 45
 
9.7%
и 41
 
8.9%
я 33
 
7.1%
т 32
 
6.9%
ѕ 32
 
6.9%
Ѕ 32
 
6.9%
м 28
 
6.0%
В 18
 
3.9%
к 18
 
3.9%
Other values (37) 133
28.7%
Gurmukhi
ValueCountFrequency (%)
20
 
10.9%
13
 
7.1%
11
 
6.0%
11
 
6.0%
10
 
5.4%
9
 
4.9%
8
 
4.3%
ਿ 8
 
4.3%
8
 
4.3%
7
 
3.8%
Other values (30) 79
42.9%
Bengali
ValueCountFrequency (%)
28
16.8%
10
 
6.0%
10
 
6.0%
10
 
6.0%
ি 9
 
5.4%
9
 
5.4%
9
 
5.4%
7
 
4.2%
6
 
3.6%
6
 
3.6%
Other values (28) 63
37.7%
Greek
ValueCountFrequency (%)
α 185
42.6%
ι 64
 
14.7%
υ 42
 
9.7%
σ 36
 
8.3%
Α 20
 
4.6%
ε 10
 
2.3%
ω 10
 
2.3%
ν 9
 
2.1%
δ 8
 
1.8%
π 6
 
1.4%
Other values (19) 44
 
10.1%
Gujarati
ValueCountFrequency (%)
5
 
10.4%
5
 
10.4%
4
 
8.3%
3
 
6.2%
2
 
4.2%
2
 
4.2%
2
 
4.2%
2
 
4.2%
2
 
4.2%
2
 
4.2%
Other values (19) 19
39.6%
Han
ValueCountFrequency (%)
3
 
9.4%
2
 
6.2%
2
 
6.2%
2
 
6.2%
2
 
6.2%
2
 
6.2%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
Other values (15) 15
46.9%
Tamil
ValueCountFrequency (%)
9
16.4%
5
 
9.1%
5
 
9.1%
4
 
7.3%
4
 
7.3%
4
 
7.3%
3
 
5.5%
3
 
5.5%
ி 3
 
5.5%
2
 
3.6%
Other values (11) 13
23.6%
Telugu
ValueCountFrequency (%)
9
18.4%
5
 
10.2%
4
 
8.2%
3
 
6.1%
3
 
6.1%
3
 
6.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
Other values (11) 14
28.6%
Inherited
ValueCountFrequency (%)
62
27.3%
̤ 49
21.6%
̈ 22
 
9.7%
15
 
6.6%
̸ 14
 
6.2%
̃ 13
 
5.7%
̲ 11
 
4.8%
̺ 11
 
4.8%
̴ 10
 
4.4%
͙ 5
 
2.2%
Other values (9) 15
 
6.6%
Armenian
ValueCountFrequency (%)
հ 8
16.7%
մ 6
12.5%
Տ 5
10.4%
օ 4
8.3%
ռ 4
8.3%
ղ 3
 
6.2%
ք 3
 
6.2%
տ 2
 
4.2%
ա 2
 
4.2%
ժ 2
 
4.2%
Other values (8) 9
18.8%
Georgian
ValueCountFrequency (%)
7
26.9%
4
15.4%
4
15.4%
3
11.5%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Other values (2) 2
 
7.7%
Thai
ValueCountFrequency (%)
5
23.8%
4
19.0%
2
 
9.5%
2
 
9.5%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
Other values (2) 2
 
9.5%
Tibetan
ValueCountFrequency (%)
14
53.8%
3
 
11.5%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Ol_Chiki
ValueCountFrequency (%)
2
22.2%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
Oriya
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Malayalam
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Cherokee
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Bopomofo
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Myanmar
ValueCountFrequency (%)
3
60.0%
2
40.0%
Hiragana
ValueCountFrequency (%)
2
66.7%
1
33.3%
Hebrew
ValueCountFrequency (%)
נ 2
66.7%
ק 1
33.3%
Katakana
ValueCountFrequency (%)
1
50.0%
1
50.0%
Canadian_Aboriginal
ValueCountFrequency (%)
3
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3648639
98.8%
Devanagari 33894
 
0.9%
None 4079
 
0.1%
Kannada 1982
 
0.1%
Arabic 1210
 
< 0.1%
Cyrillic 463
 
< 0.1%
IPA Ext 289
 
< 0.1%
Emoticons 190
 
< 0.1%
Gurmukhi 184
 
< 0.1%
Bengali 167
 
< 0.1%
Other values (40) 1130
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 544659
 
14.9%
315735
 
8.7%
i 240983
 
6.6%
h 206630
 
5.7%
n 206139
 
5.6%
r 186429
 
5.1%
u 162787
 
4.5%
e 134275
 
3.7%
s 107807
 
3.0%
m 105488
 
2.9%
Other values (79) 1437707
39.4%
Devanagari
ValueCountFrequency (%)
3792
 
11.2%
2780
 
8.2%
1745
 
5.1%
1710
 
5.0%
1592
 
4.7%
ि 1450
 
4.3%
1318
 
3.9%
1317
 
3.9%
1315
 
3.9%
1141
 
3.4%
Other values (80) 15734
46.4%
Kannada
ValueCountFrequency (%)
230
 
11.6%
165
 
8.3%
ಿ 120
 
6.1%
118
 
6.0%
93
 
4.7%
93
 
4.7%
81
 
4.1%
64
 
3.2%
62
 
3.1%
59
 
3.0%
Other values (43) 897
45.3%
None
ValueCountFrequency (%)
ñ 219
 
5.4%
α 185
 
4.5%
ã 184
 
4.5%
à 152
 
3.7%
ø 134
 
3.3%
â 118
 
2.9%
í 93
 
2.3%
î 93
 
2.3%
á 92
 
2.3%
ß 91
 
2.2%
Other values (289) 2718
66.6%
Arabic
ValueCountFrequency (%)
ا 181
15.0%
م 110
 
9.1%
ی 93
 
7.7%
ن 80
 
6.6%
ر 78
 
6.4%
د 61
 
5.0%
ب 60
 
5.0%
و 50
 
4.1%
ح 46
 
3.8%
س 41
 
3.4%
Other values (46) 410
33.9%
VS
ValueCountFrequency (%)
62
100.0%
Dingbats
ValueCountFrequency (%)
61
73.5%
5
 
6.0%
4
 
4.8%
3
 
3.6%
3
 
3.6%
2
 
2.4%
2
 
2.4%
1
 
1.2%
1
 
1.2%
1
 
1.2%
Cyrillic
ValueCountFrequency (%)
н 51
 
11.0%
є 45
 
9.7%
и 41
 
8.9%
я 33
 
7.1%
т 32
 
6.9%
ѕ 32
 
6.9%
Ѕ 32
 
6.9%
м 28
 
6.0%
В 18
 
3.9%
к 18
 
3.9%
Other values (37) 133
28.7%
Diacriticals
ValueCountFrequency (%)
̤ 49
34.5%
̈ 22
15.5%
̸ 14
 
9.9%
̃ 13
 
9.2%
̲ 11
 
7.7%
̺ 11
 
7.7%
̴ 10
 
7.0%
͙ 5
 
3.5%
̫ 4
 
2.8%
̊ 3
 
2.1%
Currency Symbols
ValueCountFrequency (%)
40
63.5%
16
 
25.4%
4
 
6.3%
1
 
1.6%
1
 
1.6%
1
 
1.6%
Misc Symbols
ValueCountFrequency (%)
36
31.0%
20
17.2%
10
 
8.6%
6
 
5.2%
6
 
5.2%
5
 
4.3%
4
 
3.4%
4
 
3.4%
4
 
3.4%
4
 
3.4%
Other values (12) 17
14.7%
IPA Ext
ValueCountFrequency (%)
ʌ 32
 
11.1%
ʜ 21
 
7.3%
ʀ 21
 
7.3%
ʝ 18
 
6.2%
ə 14
 
4.8%
ɩ 13
 
4.5%
ʋ 12
 
4.2%
ɦ 11
 
3.8%
ɾ 11
 
3.8%
ɭ 9
 
3.1%
Other values (37) 127
43.9%
Math Operators
ValueCountFrequency (%)
29
64.4%
8
 
17.8%
5
 
11.1%
2
 
4.4%
1
 
2.2%
Bengali
ValueCountFrequency (%)
28
16.8%
10
 
6.0%
10
 
6.0%
10
 
6.0%
ি 9
 
5.4%
9
 
5.4%
9
 
5.4%
7
 
4.2%
6
 
3.6%
6
 
3.6%
Other values (28) 63
37.7%
Letterlike Symbols
ValueCountFrequency (%)
28
71.8%
5
 
12.8%
2
 
5.1%
1
 
2.6%
1
 
2.6%
1
 
2.6%
1
 
2.6%
Emoticons
ValueCountFrequency (%)
😍 27
14.2%
😎 25
13.2%
😘 23
12.1%
😊 21
11.1%
🙏 17
8.9%
😂 13
6.8%
😉 13
6.8%
😁 11
 
5.8%
😄 7
 
3.7%
😋 4
 
2.1%
Other values (14) 29
15.3%
Specials
ValueCountFrequency (%)
26
100.0%
Gurmukhi
ValueCountFrequency (%)
20
 
10.9%
13
 
7.1%
11
 
6.0%
11
 
6.0%
10
 
5.4%
9
 
4.9%
8
 
4.3%
ਿ 8
 
4.3%
8
 
4.3%
7
 
3.8%
Other values (30) 79
42.9%
Enclosed Alphanum Sup
ValueCountFrequency (%)
🇳 16
31.4%
🇮 14
27.5%
🅰 3
 
5.9%
🇭 2
 
3.9%
🇦 2
 
3.9%
🅺 2
 
3.9%
🇪 1
 
2.0%
🅱 1
 
2.0%
🇸 1
 
2.0%
🇷 1
 
2.0%
Other values (8) 8
15.7%
Punctuation
ValueCountFrequency (%)
15
34.9%
8
18.6%
4
 
9.3%
4
 
9.3%
4
 
9.3%
3
 
7.0%
2
 
4.7%
2
 
4.7%
1
 
2.3%
Tibetan
ValueCountFrequency (%)
14
53.8%
3
 
11.5%
2
 
7.7%
2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Phonetic Ext
ValueCountFrequency (%)
9
18.4%
7
14.3%
ᴿ 3
 
6.1%
3
 
6.1%
3
 
6.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
Other values (13) 14
28.6%
Tamil
ValueCountFrequency (%)
9
16.4%
5
 
9.1%
5
 
9.1%
4
 
7.3%
4
 
7.3%
4
 
7.3%
3
 
5.5%
3
 
5.5%
ி 3
 
5.5%
2
 
3.6%
Other values (11) 13
23.6%
Telugu
ValueCountFrequency (%)
9
18.4%
5
 
10.2%
4
 
8.2%
3
 
6.1%
3
 
6.1%
3
 
6.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
2
 
4.1%
Other values (11) 14
28.6%
Armenian
ValueCountFrequency (%)
հ 8
16.7%
մ 6
12.5%
Տ 5
10.4%
օ 4
8.3%
ռ 4
8.3%
ղ 3
 
6.2%
ք 3
 
6.2%
տ 2
 
4.2%
ա 2
 
4.2%
ժ 2
 
4.2%
Other values (8) 9
18.8%
Georgian Sup
ValueCountFrequency (%)
7
58.3%
4
33.3%
1
 
8.3%
Geometric Shapes
ValueCountFrequency (%)
6
46.2%
3
23.1%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Gujarati
ValueCountFrequency (%)
5
 
10.4%
5
 
10.4%
4
 
8.3%
3
 
6.2%
2
 
4.2%
2
 
4.2%
2
 
4.2%
2
 
4.2%
2
 
4.2%
2
 
4.2%
Other values (19) 19
39.6%
Thai
ValueCountFrequency (%)
5
23.8%
4
19.0%
2
 
9.5%
2
 
9.5%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
1
 
4.8%
Other values (2) 2
 
9.5%
Enclosed Alphanum
ValueCountFrequency (%)
4
22.2%
3
16.7%
2
11.1%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
1
 
5.6%
Other values (2) 2
11.1%
Georgian
ValueCountFrequency (%)
4
28.6%
3
21.4%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Greek Ext
ValueCountFrequency (%)
3
100.0%
CJK
ValueCountFrequency (%)
3
 
9.4%
2
 
6.2%
2
 
6.2%
2
 
6.2%
2
 
6.2%
2
 
6.2%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
Other values (15) 15
46.9%
UCAS
ValueCountFrequency (%)
3
100.0%
Myanmar
ValueCountFrequency (%)
3
60.0%
2
40.0%
Bopomofo
ValueCountFrequency (%)
2
50.0%
1
25.0%
1
25.0%
Malayalam
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Ol Chiki
ValueCountFrequency (%)
2
22.2%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
Hiragana
ValueCountFrequency (%)
2
66.7%
1
33.3%
Cherokee
ValueCountFrequency (%)
2
25.0%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Hebrew
ValueCountFrequency (%)
נ 2
66.7%
ק 1
33.3%
Katakana
ValueCountFrequency (%)
1
50.0%
1
50.0%
Number Forms
ValueCountFrequency (%)
1
100.0%
Latin Ext Additional
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Modifier Letters
ValueCountFrequency (%)
ʳ 1
20.0%
ˢ 1
20.0%
ˡ 1
20.0%
ʰ 1
20.0%
ˊ 1
20.0%
Hangul
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Enclosed Ideographic Sup
ValueCountFrequency (%)
🈂 1
100.0%
Small Forms
ValueCountFrequency (%)
1
100.0%
Oriya
ValueCountFrequency (%)
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
1
12.5%
Box Drawing
ValueCountFrequency (%)
1
50.0%
1
50.0%

Gender
Text

MISSING 

Distinct449
Distinct (%)4.5%
Missing394673
Missing (%)97.6%
Memory size3.1 MiB
2024-07-17T20:57:55.761715image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Length

Max length36
Median length4
Mean length4.3008188
Min length1

Characters and Unicode

Total characters42548
Distinct characters122
Distinct categories14 ?
Distinct scripts5 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique391 ?
Unique (%)4.0%

Sample

1st rowMALE
2nd rowMALE
3rd rowMALE
4th rowMALE
5th rowMALE
ValueCountFrequency (%)
male 8145
81.4%
female 1086
 
10.9%
v 39
 
0.4%
i 37
 
0.4%
2.00 21
 
0.2%
ma 16
 
0.2%
k 14
 
0.1%
12
 
0.1%
s 11
 
0.1%
p 11
 
0.1%
Other values (466) 609
 
6.1%
2024-07-17T20:57:56.002437image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
E 10321
24.3%
M 9290
21.8%
A 9252
21.7%
L 9237
21.7%
F 1093
 
2.6%
551
 
1.3%
a 386
 
0.9%
r 176
 
0.4%
i 146
 
0.3%
n 127
 
0.3%
Other values (112) 1969
 
4.6%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter 39599
93.1%
Lowercase Letter 1981
 
4.7%
Space Separator 551
 
1.3%
Decimal Number 164
 
0.4%
Other Punctuation 115
 
0.3%
Other Letter 78
 
0.2%
Spacing Mark 23
 
0.1%
Nonspacing Mark 20
 
< 0.1%
Other Symbol 9
 
< 0.1%
Format 3
 
< 0.1%
Other values (4) 5
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
12
15.4%
10
12.8%
6
 
7.7%
5
 
6.4%
5
 
6.4%
4
 
5.1%
3
 
3.8%
3
 
3.8%
3
 
3.8%
3
 
3.8%
Other values (18) 24
30.8%
Uppercase Letter
ValueCountFrequency (%)
E 10321
26.1%
M 9290
23.5%
A 9252
23.4%
L 9237
23.3%
F 1093
 
2.8%
S 49
 
0.1%
B 46
 
0.1%
I 45
 
0.1%
V 41
 
0.1%
P 38
 
0.1%
Other values (16) 187
 
0.5%
Lowercase Letter
ValueCountFrequency (%)
a 386
19.5%
r 176
 
8.9%
i 146
 
7.4%
n 127
 
6.4%
h 121
 
6.1%
e 100
 
5.0%
u 94
 
4.7%
t 91
 
4.6%
o 87
 
4.4%
s 83
 
4.2%
Other values (16) 570
28.8%
Decimal Number
ValueCountFrequency (%)
0 88
53.7%
2 29
 
17.7%
1 13
 
7.9%
7 7
 
4.3%
5 6
 
3.7%
8 5
 
3.0%
9 5
 
3.0%
3 4
 
2.4%
6 4
 
2.4%
4 3
 
1.8%
Other Punctuation
ValueCountFrequency (%)
. 101
87.8%
' 5
 
4.3%
& 2
 
1.7%
; 2
 
1.7%
? 2
 
1.7%
1
 
0.9%
# 1
 
0.9%
* 1
 
0.9%
Spacing Mark
ValueCountFrequency (%)
11
47.8%
ि 4
 
17.4%
4
 
17.4%
2
 
8.7%
1
 
4.3%
1
 
4.3%
Nonspacing Mark
ValueCountFrequency (%)
6
30.0%
4
20.0%
4
20.0%
3
15.0%
2
 
10.0%
1
 
5.0%
Other Symbol
ValueCountFrequency (%)
😆 5
55.6%
😎 1
 
11.1%
🐯 1
 
11.1%
💪 1
 
11.1%
😊 1
 
11.1%
Math Symbol
ValueCountFrequency (%)
× 1
50.0%
÷ 1
50.0%
Space Separator
ValueCountFrequency (%)
551
100.0%
Format
ValueCountFrequency (%)
3
100.0%
Open Punctuation
ValueCountFrequency (%)
( 1
100.0%
Close Punctuation
ValueCountFrequency (%)
) 1
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 41580
97.7%
Common 844
 
2.0%
Devanagari 116
 
0.3%
Bengali 5
 
< 0.1%
Inherited 3
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
E 10321
24.8%
M 9290
22.3%
A 9252
22.3%
L 9237
22.2%
F 1093
 
2.6%
a 386
 
0.9%
r 176
 
0.4%
i 146
 
0.4%
n 127
 
0.3%
h 121
 
0.3%
Other values (42) 1431
 
3.4%
Devanagari
ValueCountFrequency (%)
12
 
10.3%
11
 
9.5%
10
 
8.6%
6
 
5.2%
6
 
5.2%
5
 
4.3%
5
 
4.3%
4
 
3.4%
ि 4
 
3.4%
4
 
3.4%
Other values (25) 49
42.2%
Common
ValueCountFrequency (%)
551
65.3%
. 101
 
12.0%
0 88
 
10.4%
2 29
 
3.4%
1 13
 
1.5%
7 7
 
0.8%
5 6
 
0.7%
8 5
 
0.6%
9 5
 
0.6%
😆 5
 
0.6%
Other values (19) 34
 
4.0%
Bengali
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Inherited
ValueCountFrequency (%)
3
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 42412
99.7%
Devanagari 117
 
0.3%
Emoticons 7
 
< 0.1%
Bengali 5
 
< 0.1%
None 4
 
< 0.1%
Punctuation 3
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
E 10321
24.3%
M 9290
21.9%
A 9252
21.8%
L 9237
21.8%
F 1093
 
2.6%
551
 
1.3%
a 386
 
0.9%
r 176
 
0.4%
i 146
 
0.3%
n 127
 
0.3%
Other values (63) 1833
 
4.3%
Devanagari
ValueCountFrequency (%)
12
 
10.3%
11
 
9.4%
10
 
8.5%
6
 
5.1%
6
 
5.1%
5
 
4.3%
5
 
4.3%
4
 
3.4%
ि 4
 
3.4%
4
 
3.4%
Other values (26) 50
42.7%
Emoticons
ValueCountFrequency (%)
😆 5
71.4%
😎 1
 
14.3%
😊 1
 
14.3%
Punctuation
ValueCountFrequency (%)
3
100.0%
None
ValueCountFrequency (%)
× 1
25.0%
🐯 1
25.0%
💪 1
25.0%
÷ 1
25.0%
Bengali
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Distinct2278
Distinct (%)0.6%
Missing690
Missing (%)0.2%
Memory size3.1 MiB
2024-07-17T20:57:56.172467image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Length

Max length107
Median length104
Mean length10.24564
Min length1

Characters and Unicode

Total characters4137968
Distinct characters202
Distinct categories16 ?
Distinct scripts8 ?
Distinct blocks12 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1992 ?
Unique (%)0.5%

Sample

1st rowBihar
2nd rowBihar in
3rd rowBihar
4th rowBihar
5th rowBihar in
ValueCountFrequency (%)
bihar 234197
32.3%
pradesh 111261
15.4%
uttar 111260
15.4%
in 94184
13.0%
east 69144
 
9.5%
west 42097
 
5.8%
bundi 19279
 
2.7%
goharganj 11377
 
1.6%
gairatganj 3805
 
0.5%
chhabra 3731
 
0.5%
Other values (2588) 23843
 
3.3%
2024-07-17T20:57:56.436601image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 601331
14.5%
573846
13.9%
r 492680
11.9%
h 369631
8.9%
i 367399
8.9%
t 341486
8.3%
B 260663
6.3%
s 223656
 
5.4%
e 156126
 
3.8%
n 142639
 
3.4%
Other values (192) 608511
14.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2933407
70.9%
Uppercase Letter 628738
 
15.2%
Space Separator 573846
 
13.9%
Decimal Number 1267
 
< 0.1%
Other Letter 254
 
< 0.1%
Other Punctuation 218
 
< 0.1%
Spacing Mark 62
 
< 0.1%
Nonspacing Mark 47
 
< 0.1%
Dash Punctuation 37
 
< 0.1%
Open Punctuation 28
 
< 0.1%
Other values (6) 64
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
9.8%
14
 
5.5%
13
 
5.1%
12
 
4.7%
12
 
4.7%
12
 
4.7%
10
 
3.9%
9
 
3.5%
9
 
3.5%
ر 8
 
3.1%
Other values (54) 130
51.2%
Lowercase Letter
ValueCountFrequency (%)
a 601331
20.5%
r 492680
16.8%
h 369631
12.6%
i 367399
12.5%
t 341486
11.6%
s 223656
 
7.6%
e 156126
 
5.3%
n 142639
 
4.9%
d 139951
 
4.8%
u 25600
 
0.9%
Other values (30) 72908
 
2.5%
Uppercase Letter
ValueCountFrequency (%)
B 260663
41.5%
P 111767
17.8%
U 111331
17.7%
E 69213
 
11.0%
W 42116
 
6.7%
G 15671
 
2.5%
C 5001
 
0.8%
A 2768
 
0.4%
K 2708
 
0.4%
D 2682
 
0.4%
Other values (15) 4818
 
0.8%
Other Symbol
ValueCountFrequency (%)
😊 3
 
12.5%
🙅 2
 
8.3%
😎 2
 
8.3%
💪 1
 
4.2%
🔫 1
 
4.2%
😂 1
 
4.2%
🐯 1
 
4.2%
🚨 1
 
4.2%
🐆 1
 
4.2%
👈 1
 
4.2%
Other values (10) 10
41.7%
Other Punctuation
ValueCountFrequency (%)
. 157
72.0%
# 16
 
7.3%
/ 15
 
6.9%
? 9
 
4.1%
: 7
 
3.2%
' 3
 
1.4%
* 3
 
1.4%
& 3
 
1.4%
! 2
 
0.9%
1
 
0.5%
Other values (2) 2
 
0.9%
Decimal Number
ValueCountFrequency (%)
0 257
20.3%
1 206
16.3%
2 156
12.3%
5 144
11.4%
8 116
9.2%
3 116
9.2%
4 85
 
6.7%
6 81
 
6.4%
7 59
 
4.7%
9 47
 
3.7%
Nonspacing Mark
ValueCountFrequency (%)
15
31.9%
7
14.9%
6
 
12.8%
6
 
12.8%
5
 
10.6%
2
 
4.3%
2
 
4.3%
2
 
4.3%
1
 
2.1%
1
 
2.1%
Spacing Mark
ValueCountFrequency (%)
28
45.2%
11
 
17.7%
9
 
14.5%
ि 4
 
6.5%
3
 
4.8%
3
 
4.8%
2
 
3.2%
1
 
1.6%
1
 
1.6%
Open Punctuation
ValueCountFrequency (%)
( 26
92.9%
1
 
3.6%
{ 1
 
3.6%
Close Punctuation
ValueCountFrequency (%)
) 26
96.3%
} 1
 
3.7%
Math Symbol
ValueCountFrequency (%)
+ 9
90.0%
1
 
10.0%
Space Separator
ValueCountFrequency (%)
573846
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 37
100.0%
Currency Symbol
ValueCountFrequency (%)
¢ 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%
Control
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 3562106
86.1%
Common 575460
 
13.9%
Devanagari 278
 
< 0.1%
Arabic 44
 
< 0.1%
Kannada 40
 
< 0.1%
Cyrillic 21
 
< 0.1%
Greek 18
 
< 0.1%
Inherited 1
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
573846
99.7%
0 257
 
< 0.1%
1 206
 
< 0.1%
. 157
 
< 0.1%
2 156
 
< 0.1%
5 144
 
< 0.1%
8 116
 
< 0.1%
3 116
 
< 0.1%
4 85
 
< 0.1%
6 81
 
< 0.1%
Other values (44) 296
 
0.1%
Latin
ValueCountFrequency (%)
a 601331
16.9%
r 492680
13.8%
h 369631
10.4%
i 367399
10.3%
t 341486
9.6%
B 260663
7.3%
s 223656
 
6.3%
e 156126
 
4.4%
n 142639
 
4.0%
d 139951
 
3.9%
Other values (43) 466544
13.1%
Devanagari
ValueCountFrequency (%)
28
 
10.1%
25
 
9.0%
15
 
5.4%
14
 
5.0%
13
 
4.7%
12
 
4.3%
12
 
4.3%
12
 
4.3%
11
 
4.0%
10
 
3.6%
Other values (38) 126
45.3%
Kannada
ValueCountFrequency (%)
6
15.0%
5
12.5%
4
10.0%
4
10.0%
3
 
7.5%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
1
 
2.5%
Other values (9) 9
22.5%
Arabic
ValueCountFrequency (%)
ر 8
18.2%
م 5
11.4%
ش 5
11.4%
ی 5
11.4%
و 3
 
6.8%
ہ 3
 
6.8%
ف 2
 
4.5%
ا 2
 
4.5%
ب 2
 
4.5%
ج 2
 
4.5%
Other values (5) 7
15.9%
Cyrillic
ValueCountFrequency (%)
я 4
19.0%
ѕ 4
19.0%
и 4
19.0%
т 3
14.3%
к 2
9.5%
м 2
9.5%
є 1
 
4.8%
н 1
 
4.8%
Greek
ValueCountFrequency (%)
α 9
50.0%
ι 4
22.2%
ρ 3
 
16.7%
υ 2
 
11.1%
Inherited
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4137536
> 99.9%
Devanagari 279
 
< 0.1%
Arabic 44
 
< 0.1%
Kannada 40
 
< 0.1%
None 31
 
< 0.1%
Cyrillic 21
 
< 0.1%
Emoticons 12
 
< 0.1%
Punctuation 1
 
< 0.1%
Misc Symbols 1
 
< 0.1%
VS 1
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 601331
14.5%
573846
13.9%
r 492680
11.9%
h 369631
8.9%
i 367399
8.9%
t 341486
8.3%
B 260663
6.3%
s 223656
 
5.4%
e 156126
 
3.8%
n 142639
 
3.4%
Other values (71) 608079
14.7%
Devanagari
ValueCountFrequency (%)
28
 
10.0%
25
 
9.0%
15
 
5.4%
14
 
5.0%
13
 
4.7%
12
 
4.3%
12
 
4.3%
12
 
4.3%
11
 
3.9%
10
 
3.6%
Other values (39) 127
45.5%
None
ValueCountFrequency (%)
α 9
29.0%
ι 4
12.9%
ρ 3
 
9.7%
υ 2
 
6.5%
💪 1
 
3.2%
🔫 1
 
3.2%
🐯 1
 
3.2%
🚨 1
 
3.2%
🐆 1
 
3.2%
¢ 1
 
3.2%
Other values (7) 7
22.6%
Arabic
ValueCountFrequency (%)
ر 8
18.2%
م 5
11.4%
ش 5
11.4%
ی 5
11.4%
و 3
 
6.8%
ہ 3
 
6.8%
ف 2
 
4.5%
ا 2
 
4.5%
ب 2
 
4.5%
ج 2
 
4.5%
Other values (5) 7
15.9%
Kannada
ValueCountFrequency (%)
6
15.0%
5
12.5%
4
10.0%
4
10.0%
3
 
7.5%
2
 
5.0%
2
 
5.0%
2
 
5.0%
2
 
5.0%
1
 
2.5%
Other values (9) 9
22.5%
Cyrillic
ValueCountFrequency (%)
я 4
19.0%
ѕ 4
19.0%
и 4
19.0%
т 3
14.3%
к 2
9.5%
м 2
9.5%
є 1
 
4.8%
н 1
 
4.8%
Emoticons
ValueCountFrequency (%)
😊 3
25.0%
🙅 2
16.7%
😎 2
16.7%
😂 1
 
8.3%
😘 1
 
8.3%
😀 1
 
8.3%
😉 1
 
8.3%
😏 1
 
8.3%
Punctuation
ValueCountFrequency (%)
1
100.0%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
VS
ValueCountFrequency (%)
1
100.0%
Arrows
ValueCountFrequency (%)
1
100.0%
Dingbats
ValueCountFrequency (%)
1
100.0%

JobTitle
Text

MISSING 

Distinct2031
Distinct (%)3.4%
Missing345516
Missing (%)85.4%
Memory size3.1 MiB
2024-07-17T20:57:56.616236image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Length

Max length88
Median length54
Mean length13.14376
Min length1

Characters and Unicode

Total characters776139
Distinct characters188
Distinct categories17 ?
Distinct scripts7 ?
Distinct blocks15 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1636 ?
Unique (%)2.8%

Sample

1st row in
2nd rowcentar
3rd rownow
4th rowDARLING
5th row sector.10
ValueCountFrequency (%)
rajasthan 29881
31.1%
pradesh 23540
24.5%
madhya 23286
24.2%
in 12970
13.5%
india 860
 
0.9%
bangalore 528
 
0.5%
bihar 434
 
0.5%
uttar 253
 
0.3%
patna 165
 
0.2%
jharkhand 117
 
0.1%
Other values (2080) 4035
 
4.2%
2024-07-17T20:57:56.878763image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 165808
21.4%
135754
17.5%
h 78227
10.1%
s 54168
 
7.0%
d 48222
 
6.2%
n 45659
 
5.9%
t 31377
 
4.0%
R 30099
 
3.9%
j 30035
 
3.9%
r 26242
 
3.4%
Other values (178) 130548
16.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 549687
70.8%
Space Separator 135754
 
17.5%
Uppercase Letter 81902
 
10.6%
Decimal Number 7436
 
1.0%
Other Punctuation 1102
 
0.1%
Other Letter 85
 
< 0.1%
Other Symbol 34
 
< 0.1%
Spacing Mark 31
 
< 0.1%
Dash Punctuation 27
 
< 0.1%
Nonspacing Mark 25
 
< 0.1%
Other values (7) 56
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 165808
30.2%
h 78227
14.2%
s 54168
 
9.9%
d 48222
 
8.8%
n 45659
 
8.3%
t 31377
 
5.7%
j 30035
 
5.5%
r 26242
 
4.8%
e 25043
 
4.6%
y 23540
 
4.3%
Other values (35) 21366
 
3.9%
Other Letter
ValueCountFrequency (%)
11
 
12.9%
6
 
7.1%
5
 
5.9%
4
 
4.7%
4
 
4.7%
4
 
4.7%
4
 
4.7%
3
 
3.5%
3
 
3.5%
3
 
3.5%
Other values (28) 38
44.7%
Uppercase Letter
ValueCountFrequency (%)
R 30099
36.8%
P 23806
29.1%
M 23509
28.7%
B 1212
 
1.5%
I 948
 
1.2%
U 295
 
0.4%
A 243
 
0.3%
K 208
 
0.3%
E 195
 
0.2%
J 188
 
0.2%
Other values (15) 1199
 
1.5%
Other Symbol
ValueCountFrequency (%)
😎 4
 
11.8%
🇦 3
 
8.8%
💓 2
 
5.9%
2
 
5.9%
🙏 2
 
5.9%
👆 2
 
5.9%
1
 
2.9%
🐅 1
 
2.9%
1
 
2.9%
👌 1
 
2.9%
Other values (15) 15
44.1%
Other Punctuation
ValueCountFrequency (%)
. 1037
94.1%
@ 10
 
0.9%
& 10
 
0.9%
* 10
 
0.9%
? 10
 
0.9%
; 8
 
0.7%
' 6
 
0.5%
# 5
 
0.5%
2
 
0.2%
: 2
 
0.2%
Spacing Mark
ValueCountFrequency (%)
13
41.9%
6
19.4%
2
 
6.5%
2
 
6.5%
2
 
6.5%
1
 
3.2%
1
 
3.2%
1
 
3.2%
1
 
3.2%
ि 1
 
3.2%
Decimal Number
ValueCountFrequency (%)
0 3111
41.8%
1 858
 
11.5%
2 782
 
10.5%
8 637
 
8.6%
5 547
 
7.4%
3 431
 
5.8%
4 408
 
5.5%
6 313
 
4.2%
7 228
 
3.1%
9 121
 
1.6%
Nonspacing Mark
ValueCountFrequency (%)
6
24.0%
ಿ 4
16.0%
4
16.0%
3
12.0%
3
12.0%
3
12.0%
1
 
4.0%
1
 
4.0%
Math Symbol
ValueCountFrequency (%)
+ 3
60.0%
1
 
20.0%
1
 
20.0%
Open Punctuation
ValueCountFrequency (%)
( 17
94.4%
{ 1
 
5.6%
Close Punctuation
ValueCountFrequency (%)
) 17
94.4%
} 1
 
5.6%
Connector Punctuation
ValueCountFrequency (%)
1
50.0%
_ 1
50.0%
Currency Symbol
ValueCountFrequency (%)
$ 1
50.0%
¢ 1
50.0%
Space Separator
ValueCountFrequency (%)
135754
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 27
100.0%
Format
ValueCountFrequency (%)
9
100.0%
Modifier Symbol
ValueCountFrequency (%)
^ 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 631575
81.4%
Common 144409
 
18.6%
Devanagari 103
 
< 0.1%
Kannada 35
 
< 0.1%
Cyrillic 8
 
< 0.1%
Greek 6
 
< 0.1%
Inherited 3
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
135754
94.0%
0 3111
 
2.2%
. 1037
 
0.7%
1 858
 
0.6%
2 782
 
0.5%
8 637
 
0.4%
5 547
 
0.4%
3 431
 
0.3%
4 408
 
0.3%
6 313
 
0.2%
Other values (51) 531
 
0.4%
Latin
ValueCountFrequency (%)
a 165808
26.3%
h 78227
12.4%
s 54168
 
8.6%
d 48222
 
7.6%
n 45659
 
7.2%
t 31377
 
5.0%
R 30099
 
4.8%
j 30035
 
4.8%
r 26242
 
4.2%
e 25043
 
4.0%
Other values (50) 96695
15.3%
Devanagari
ValueCountFrequency (%)
13
 
12.6%
11
 
10.7%
6
 
5.8%
6
 
5.8%
6
 
5.8%
5
 
4.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
Other values (25) 40
38.8%
Kannada
ValueCountFrequency (%)
ಿ 4
 
11.4%
4
 
11.4%
3
 
8.6%
2
 
5.7%
2
 
5.7%
2
 
5.7%
2
 
5.7%
2
 
5.7%
2
 
5.7%
1
 
2.9%
Other values (11) 11
31.4%
Cyrillic
ValueCountFrequency (%)
и 2
25.0%
т 2
25.0%
у 1
12.5%
н 1
12.5%
э 1
12.5%
я 1
12.5%
Greek
ValueCountFrequency (%)
α 3
50.0%
ν 1
 
16.7%
ι 1
 
16.7%
υ 1
 
16.7%
Inherited
ValueCountFrequency (%)
3
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 775867
> 99.9%
Devanagari 103
 
< 0.1%
None 82
 
< 0.1%
Kannada 35
 
< 0.1%
Punctuation 12
 
< 0.1%
Enclosed Alphanum Sup 9
 
< 0.1%
Cyrillic 8
 
< 0.1%
Emoticons 7
 
< 0.1%
VS 3
 
< 0.1%
Misc Symbols 3
 
< 0.1%
Other values (5) 10
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 165808
21.4%
135754
17.5%
h 78227
10.1%
s 54168
 
7.0%
d 48222
 
6.2%
n 45659
 
5.9%
t 31377
 
4.0%
R 30099
 
3.9%
j 30035
 
3.9%
r 26242
 
3.4%
Other values (71) 130276
16.8%
None
ValueCountFrequency (%)
ā 60
73.2%
α 3
 
3.7%
💓 2
 
2.4%
👆 2
 
2.4%
ν 1
 
1.2%
ι 1
 
1.2%
🐅 1
 
1.2%
đ 1
 
1.2%
υ 1
 
1.2%
👌 1
 
1.2%
Other values (9) 9
 
11.0%
Devanagari
ValueCountFrequency (%)
13
 
12.6%
11
 
10.7%
6
 
5.8%
6
 
5.8%
6
 
5.8%
5
 
4.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
4
 
3.9%
Other values (25) 40
38.8%
Punctuation
ValueCountFrequency (%)
9
75.0%
2
 
16.7%
1
 
8.3%
Kannada
ValueCountFrequency (%)
ಿ 4
 
11.4%
4
 
11.4%
3
 
8.6%
2
 
5.7%
2
 
5.7%
2
 
5.7%
2
 
5.7%
2
 
5.7%
2
 
5.7%
1
 
2.9%
Other values (11) 11
31.4%
Emoticons
ValueCountFrequency (%)
😎 4
57.1%
🙏 2
28.6%
🙅 1
 
14.3%
VS
ValueCountFrequency (%)
3
100.0%
Enclosed Alphanum Sup
ValueCountFrequency (%)
🇦 3
33.3%
🇱 1
 
11.1%
🇮 1
 
11.1%
🇭 1
 
11.1%
🇸 1
 
11.1%
🇳 1
 
11.1%
🇲 1
 
11.1%
Cyrillic
ValueCountFrequency (%)
и 2
25.0%
т 2
25.0%
у 1
12.5%
н 1
12.5%
э 1
12.5%
я 1
12.5%
Misc Symbols
ValueCountFrequency (%)
2
66.7%
1
33.3%
Math Operators
ValueCountFrequency (%)
1
100.0%
Phonetic Ext
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Dingbats
ValueCountFrequency (%)
1
50.0%
1
50.0%
IPA Ext
ValueCountFrequency (%)
ʟ 1
33.3%
ʜ 1
33.3%
ʀ 1
33.3%
Arrows
ValueCountFrequency (%)
1
100.0%

CompanyName
Text

MISSING 

Distinct1904
Distinct (%)42.3%
Missing400061
Missing (%)98.9%
Memory size3.1 MiB
2024-07-17T20:57:57.106704image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Length

Max length45
Median length42
Mean length8.3396226
Min length1

Characters and Unicode

Total characters37570
Distinct characters214
Distinct categories16 ?
Distinct scripts8 ?
Distinct blocks14 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1652 ?
Unique (%)36.7%

Sample

1st rowBilder
2nd rowAn Ideal coaching
3rd rowElectrosteel Steel L T D
4th rowBest PU College
5th row Noida
ValueCountFrequency (%)
india 990
 
16.7%
in 583
 
9.9%
pradesh 137
 
2.3%
bangalore 108
 
1.8%
uttar 96
 
1.6%
patna 91
 
1.5%
karnataka 82
 
1.4%
jharkhand 74
 
1.3%
bihar 68
 
1.1%
rajasthan 55
 
0.9%
Other values (2048) 3631
61.4%
2024-07-17T20:57:57.318979image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 5028
 
13.4%
4405
 
11.7%
n 3148
 
8.4%
i 2956
 
7.9%
r 2013
 
5.4%
d 1775
 
4.7%
e 1537
 
4.1%
t 1344
 
3.6%
h 1225
 
3.3%
I 1172
 
3.1%
Other values (204) 12967
34.5%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 26132
69.6%
Uppercase Letter 5942
 
15.8%
Space Separator 4405
 
11.7%
Decimal Number 571
 
1.5%
Other Punctuation 261
 
0.7%
Other Letter 122
 
0.3%
Spacing Mark 32
 
0.1%
Other Symbol 29
 
0.1%
Nonspacing Mark 27
 
0.1%
Dash Punctuation 14
 
< 0.1%
Other values (6) 35
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
8
 
6.6%
7
 
5.7%
6
 
4.9%
6
 
4.9%
5
 
4.1%
5
 
4.1%
4
 
3.3%
4
 
3.3%
3
 
2.5%
3
 
2.5%
Other values (45) 71
58.2%
Lowercase Letter
ValueCountFrequency (%)
a 5028
19.2%
n 3148
12.0%
i 2956
11.3%
r 2013
 
7.7%
d 1775
 
6.8%
e 1537
 
5.9%
t 1344
 
5.1%
h 1225
 
4.7%
o 1057
 
4.0%
s 950
 
3.6%
Other values (44) 5099
19.5%
Uppercase Letter
ValueCountFrequency (%)
I 1172
19.7%
B 516
 
8.7%
P 434
 
7.3%
S 414
 
7.0%
A 363
 
6.1%
R 303
 
5.1%
M 298
 
5.0%
C 227
 
3.8%
K 215
 
3.6%
T 210
 
3.5%
Other values (20) 1790
30.1%
Other Symbol
ValueCountFrequency (%)
😎 6
20.7%
😘 2
 
6.9%
2
 
6.9%
🇨 2
 
6.9%
🇮 2
 
6.9%
🕍 1
 
3.4%
😂 1
 
3.4%
🤝 1
 
3.4%
🏤 1
 
3.4%
👈 1
 
3.4%
Other values (10) 10
34.5%
Other Punctuation
ValueCountFrequency (%)
. 207
79.3%
& 23
 
8.8%
! 11
 
4.2%
' 7
 
2.7%
@ 6
 
2.3%
2
 
0.8%
/ 1
 
0.4%
; 1
 
0.4%
? 1
 
0.4%
§ 1
 
0.4%
Decimal Number
ValueCountFrequency (%)
0 236
41.3%
1 78
 
13.7%
2 59
 
10.3%
5 40
 
7.0%
8 38
 
6.7%
7 27
 
4.7%
6 26
 
4.6%
3 25
 
4.4%
4 23
 
4.0%
9 19
 
3.3%
Spacing Mark
ValueCountFrequency (%)
11
34.4%
9
28.1%
3
 
9.4%
2
 
6.2%
2
 
6.2%
1
 
3.1%
1
 
3.1%
1
 
3.1%
1
 
3.1%
ि 1
 
3.1%
Nonspacing Mark
ValueCountFrequency (%)
6
22.2%
6
22.2%
5
18.5%
3
11.1%
2
 
7.4%
2
 
7.4%
1
 
3.7%
1
 
3.7%
1
 
3.7%
Currency Symbol
ValueCountFrequency (%)
¤ 2
33.3%
£ 1
16.7%
$ 1
16.7%
¥ 1
16.7%
¢ 1
16.7%
Math Symbol
ValueCountFrequency (%)
| 1
25.0%
1
25.0%
÷ 1
25.0%
+ 1
25.0%
Space Separator
ValueCountFrequency (%)
4405
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 11
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 32058
85.3%
Common 5315
 
14.1%
Devanagari 129
 
0.3%
Kannada 36
 
0.1%
Arabic 15
 
< 0.1%
Greek 12
 
< 0.1%
Cyrillic 4
 
< 0.1%
Inherited 1
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 5028
15.7%
n 3148
 
9.8%
i 2956
 
9.2%
r 2013
 
6.3%
d 1775
 
5.5%
e 1537
 
4.8%
t 1344
 
4.2%
h 1225
 
3.8%
I 1172
 
3.7%
o 1057
 
3.3%
Other values (65) 10803
33.7%
Common
ValueCountFrequency (%)
4405
82.9%
0 236
 
4.4%
. 207
 
3.9%
1 78
 
1.5%
2 59
 
1.1%
5 40
 
0.8%
8 38
 
0.7%
7 27
 
0.5%
6 26
 
0.5%
3 25
 
0.5%
Other values (46) 174
 
3.3%
Devanagari
ValueCountFrequency (%)
11
 
8.5%
9
 
7.0%
8
 
6.2%
7
 
5.4%
6
 
4.7%
6
 
4.7%
6
 
4.7%
6
 
4.7%
5
 
3.9%
5
 
3.9%
Other values (30) 60
46.5%
Kannada
ValueCountFrequency (%)
5
13.9%
3
 
8.3%
3
 
8.3%
3
 
8.3%
2
 
5.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
1
 
2.8%
Other values (11) 11
30.6%
Arabic
ValueCountFrequency (%)
ا 3
20.0%
ر 2
13.3%
ی 1
 
6.7%
ش 1
 
6.7%
ہ 1
 
6.7%
ب 1
 
6.7%
ن 1
 
6.7%
ت 1
 
6.7%
س 1
 
6.7%
ک 1
 
6.7%
Other values (2) 2
13.3%
Greek
ValueCountFrequency (%)
α 4
33.3%
ρ 2
16.7%
Π 2
16.7%
ν 1
 
8.3%
υ 1
 
8.3%
β 1
 
8.3%
ι 1
 
8.3%
Cyrillic
ValueCountFrequency (%)
т 2
50.0%
и 2
50.0%
Inherited
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 37309
99.3%
Devanagari 129
 
0.3%
None 44
 
0.1%
Kannada 36
 
0.1%
Emoticons 15
 
< 0.1%
Arabic 15
 
< 0.1%
Cyrillic 4
 
< 0.1%
Enclosed Alphanum Sup 4
 
< 0.1%
Phonetic Ext 4
 
< 0.1%
IPA Ext 3
 
< 0.1%
Other values (4) 7
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 5028
 
13.5%
4405
 
11.8%
n 3148
 
8.4%
i 2956
 
7.9%
r 2013
 
5.4%
d 1775
 
4.8%
e 1537
 
4.1%
t 1344
 
3.6%
h 1225
 
3.3%
I 1172
 
3.1%
Other values (69) 12706
34.1%
Devanagari
ValueCountFrequency (%)
11
 
8.5%
9
 
7.0%
8
 
6.2%
7
 
5.4%
6
 
4.7%
6
 
4.7%
6
 
4.7%
6
 
4.7%
5
 
3.9%
5
 
3.9%
Other values (30) 60
46.5%
Emoticons
ValueCountFrequency (%)
😎 6
40.0%
😘 2
 
13.3%
😂 1
 
6.7%
😋 1
 
6.7%
😍 1
 
6.7%
😊 1
 
6.7%
🙅 1
 
6.7%
😉 1
 
6.7%
😆 1
 
6.7%
Kannada
ValueCountFrequency (%)
5
13.9%
3
 
8.3%
3
 
8.3%
3
 
8.3%
2
 
5.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
2
 
5.6%
1
 
2.8%
Other values (11) 11
30.6%
None
ValueCountFrequency (%)
α 4
 
9.1%
ρ 2
 
4.5%
Π 2
 
4.5%
¤ 2
 
4.5%
🕍 1
 
2.3%
Á 1
 
2.3%
🤝 1
 
2.3%
ß 1
 
2.3%
ł 1
 
2.3%
🏤 1
 
2.3%
Other values (28) 28
63.6%
Arabic
ValueCountFrequency (%)
ا 3
20.0%
ر 2
13.3%
ی 1
 
6.7%
ش 1
 
6.7%
ہ 1
 
6.7%
ب 1
 
6.7%
ن 1
 
6.7%
ت 1
 
6.7%
س 1
 
6.7%
ک 1
 
6.7%
Other values (2) 2
13.3%
Cyrillic
ValueCountFrequency (%)
т 2
50.0%
и 2
50.0%
IPA Ext
ValueCountFrequency (%)
ɦ 2
66.7%
ɨ 1
33.3%
Dingbats
ValueCountFrequency (%)
2
100.0%
Punctuation
ValueCountFrequency (%)
2
66.7%
1
33.3%
Enclosed Alphanum Sup
ValueCountFrequency (%)
🇨 2
50.0%
🇮 2
50.0%
Phonetic Ext
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%
Arrows
ValueCountFrequency (%)
1
100.0%
VS
ValueCountFrequency (%)
1
100.0%

Email
Text

MISSING 

Distinct45941
Distinct (%)95.2%
Missing356314
Missing (%)88.1%
Memory size3.1 MiB
2024-07-17T20:57:57.443506image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Length

Max length44
Median length38
Mean length22.383362
Min length1

Characters and Unicode

Total characters1080042
Distinct characters126
Distinct categories15 ?
Distinct scripts5 ?
Distinct blocks7 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique45843 ?
Unique (%)95.0%

Sample

1st rowrajata860@gmail.com
2nd rownaveenkuta3101@gmail.com
3rd rowparshuramk1970@gmail.com
4th rowkss@gmail.com
5th rowmdsameershaikh75@gmail.com
ValueCountFrequency (%)
in 2033
 
4.2%
bihār 59
 
0.1%
jharkhand 38
 
0.1%
karnataka 22
 
< 0.1%
india 20
 
< 0.1%
pradesh 18
 
< 0.1%
uttar 12
 
< 0.1%
student 11
 
< 0.1%
bangalore 11
 
< 0.1%
bihar 9
 
< 0.1%
Other values (46078) 46344
95.4%
2024-07-17T20:57:57.654410image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 133986
 
12.4%
m 113719
 
10.5%
i 79537
 
7.4%
o 56829
 
5.3%
l 55976
 
5.2%
g 53110
 
4.9%
. 51228
 
4.7%
c 50167
 
4.6%
@ 45534
 
4.2%
r 38784
 
3.6%
Other values (116) 401172
37.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 839478
77.7%
Decimal Number 137410
 
12.7%
Other Punctuation 96773
 
9.0%
Uppercase Letter 3516
 
0.3%
Space Separator 2628
 
0.2%
Connector Punctuation 167
 
< 0.1%
Other Letter 32
 
< 0.1%
Dash Punctuation 17
 
< 0.1%
Spacing Mark 7
 
< 0.1%
Nonspacing Mark 7
 
< 0.1%
Other values (5) 7
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 133986
16.0%
m 113719
13.5%
i 79537
9.5%
o 56829
 
6.8%
l 55976
 
6.7%
g 53110
 
6.3%
c 50167
 
6.0%
r 38784
 
4.6%
h 34446
 
4.1%
n 33886
 
4.0%
Other values (27) 189038
22.5%
Uppercase Letter
ValueCountFrequency (%)
S 449
12.8%
A 417
11.9%
R 297
 
8.4%
M 263
 
7.5%
B 210
 
6.0%
G 201
 
5.7%
K 193
 
5.5%
P 173
 
4.9%
N 143
 
4.1%
D 136
 
3.9%
Other values (21) 1034
29.4%
Other Letter
ValueCountFrequency (%)
3
 
9.4%
گ 3
 
9.4%
2
 
6.2%
2
 
6.2%
ا 2
 
6.2%
د 1
 
3.1%
ے 1
 
3.1%
ح 1
 
3.1%
1
 
3.1%
1
 
3.1%
Other values (15) 15
46.9%
Decimal Number
ValueCountFrequency (%)
0 18563
13.5%
1 18122
13.2%
9 16304
11.9%
2 15016
10.9%
7 12584
9.2%
8 11875
8.6%
3 11454
8.3%
6 11305
8.2%
4 11252
8.2%
5 10935
8.0%
Other Punctuation
ValueCountFrequency (%)
. 51228
52.9%
@ 45534
47.1%
& 5
 
< 0.1%
# 2
 
< 0.1%
? 2
 
< 0.1%
' 1
 
< 0.1%
/ 1
 
< 0.1%
Nonspacing Mark
ValueCountFrequency (%)
2
28.6%
2
28.6%
1
14.3%
1
14.3%
1
14.3%
Spacing Mark
ValueCountFrequency (%)
5
71.4%
1
 
14.3%
ि 1
 
14.3%
Space Separator
ValueCountFrequency (%)
2628
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 167
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 17
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Other Symbol
ValueCountFrequency (%)
👊 1
100.0%
Currency Symbol
ValueCountFrequency (%)
1
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 842993
78.1%
Common 237003
 
21.9%
Devanagari 33
 
< 0.1%
Arabic 8
 
< 0.1%
Kannada 5
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 133986
15.9%
m 113719
13.5%
i 79537
9.4%
o 56829
 
6.7%
l 55976
 
6.6%
g 53110
 
6.3%
c 50167
 
6.0%
r 38784
 
4.6%
h 34446
 
4.1%
n 33886
 
4.0%
Other values (57) 192553
22.8%
Common
ValueCountFrequency (%)
. 51228
21.6%
@ 45534
19.2%
0 18563
 
7.8%
1 18122
 
7.6%
9 16304
 
6.9%
2 15016
 
6.3%
7 12584
 
5.3%
8 11875
 
5.0%
3 11454
 
4.8%
6 11305
 
4.8%
Other values (16) 25018
10.6%
Devanagari
ValueCountFrequency (%)
5
 
15.2%
3
 
9.1%
2
 
6.1%
2
 
6.1%
2
 
6.1%
2
 
6.1%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
Other values (13) 13
39.4%
Arabic
ValueCountFrequency (%)
گ 3
37.5%
ا 2
25.0%
د 1
 
12.5%
ے 1
 
12.5%
ح 1
 
12.5%
Kannada
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 1079920
> 99.9%
None 70
 
< 0.1%
Devanagari 33
 
< 0.1%
Arabic 8
 
< 0.1%
IPA Ext 5
 
< 0.1%
Kannada 5
 
< 0.1%
Currency Symbols 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 133986
 
12.4%
m 113719
 
10.5%
i 79537
 
7.4%
o 56829
 
5.3%
l 55976
 
5.2%
g 53110
 
4.9%
. 51228
 
4.7%
c 50167
 
4.6%
@ 45534
 
4.2%
r 38784
 
3.6%
Other values (65) 401050
37.1%
None
ValueCountFrequency (%)
ā 59
84.3%
µ 1
 
1.4%
ǿ 1
 
1.4%
👊 1
 
1.4%
Ť 1
 
1.4%
Č 1
 
1.4%
Ř 1
 
1.4%
ű 1
 
1.4%
ã 1
 
1.4%
Þ 1
 
1.4%
Other values (2) 2
 
2.9%
Devanagari
ValueCountFrequency (%)
5
 
15.2%
3
 
9.1%
2
 
6.1%
2
 
6.1%
2
 
6.1%
2
 
6.1%
1
 
3.0%
1
 
3.0%
1
 
3.0%
1
 
3.0%
Other values (13) 13
39.4%
Arabic
ValueCountFrequency (%)
گ 3
37.5%
ا 2
25.0%
د 1
 
12.5%
ے 1
 
12.5%
ح 1
 
12.5%
IPA Ext
ValueCountFrequency (%)
ʍ 1
20.0%
ɲ 1
20.0%
ʀ 1
20.0%
ɘ 1
20.0%
ɭ 1
20.0%
Kannada
ValueCountFrequency (%)
1
20.0%
1
20.0%
1
20.0%
1
20.0%
1
20.0%
Currency Symbols
ValueCountFrequency (%)
1
100.0%

Facebook
Text

MISSING 

Distinct9686
Distinct (%)97.4%
Missing394619
Missing (%)97.5%
Memory size3.1 MiB
2024-07-17T20:57:57.792252image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Length

Max length118
Median length48
Mean length20.768976
Min length1

Characters and Unicode

Total characters206589
Distinct characters143
Distinct categories15 ?
Distinct scripts7 ?
Distinct blocks9 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique9655 ?
Unique (%)97.1%

Sample

1st rowgangarathnakar@gmail.com
2nd row1376066145877140.00
3rd row934211050099289.00
4th row in
5th row875620199279832.00
ValueCountFrequency (%)
india 158
 
1.5%
in 67
 
0.7%
student 13
 
0.1%
9
 
0.1%
bihar 8
 
0.1%
director 6
 
0.1%
company 6
 
0.1%
bangalore 5
 
< 0.1%
business 5
 
< 0.1%
ltd 5
 
< 0.1%
Other values (9847) 9979
97.3%
2024-07-17T20:57:58.046510image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 17973
 
8.7%
0 16218
 
7.9%
m 15260
 
7.4%
i 10685
 
5.2%
. 9940
 
4.8%
1 8457
 
4.1%
o 7612
 
3.7%
l 7432
 
3.6%
2 7361
 
3.6%
g 7008
 
3.4%
Other values (133) 98643
47.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 113361
54.9%
Decimal Number 75363
36.5%
Other Punctuation 15967
 
7.7%
Uppercase Letter 1182
 
0.6%
Space Separator 571
 
0.3%
Other Letter 58
 
< 0.1%
Spacing Mark 26
 
< 0.1%
Connector Punctuation 22
 
< 0.1%
Nonspacing Mark 16
 
< 0.1%
Other Symbol 8
 
< 0.1%
Other values (5) 15
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 17973
15.9%
m 15260
13.5%
i 10685
9.4%
o 7612
 
6.7%
l 7432
 
6.6%
g 7008
 
6.2%
c 6687
 
5.9%
r 5519
 
4.9%
n 4587
 
4.0%
h 4496
 
4.0%
Other values (32) 26102
23.0%
Other Letter
ValueCountFrequency (%)
5
 
8.6%
5
 
8.6%
4
 
6.9%
4
 
6.9%
3
 
5.2%
3
 
5.2%
3
 
5.2%
2
 
3.4%
2
 
3.4%
2
 
3.4%
Other values (21) 25
43.1%
Uppercase Letter
ValueCountFrequency (%)
I 195
16.5%
A 118
 
10.0%
S 112
 
9.5%
R 72
 
6.1%
B 68
 
5.8%
M 56
 
4.7%
C 54
 
4.6%
T 53
 
4.5%
N 51
 
4.3%
E 50
 
4.2%
Other values (16) 353
29.9%
Decimal Number
ValueCountFrequency (%)
0 16218
21.5%
1 8457
11.2%
2 7361
9.8%
9 6441
 
8.5%
7 6376
 
8.5%
4 6349
 
8.4%
3 6304
 
8.4%
8 6005
 
8.0%
6 5930
 
7.9%
5 5922
 
7.9%
Other Punctuation
ValueCountFrequency (%)
. 9940
62.3%
@ 6007
37.6%
& 7
 
< 0.1%
/ 6
 
< 0.1%
' 3
 
< 0.1%
# 2
 
< 0.1%
: 1
 
< 0.1%
\ 1
 
< 0.1%
Spacing Mark
ValueCountFrequency (%)
14
53.8%
6
23.1%
ि 2
 
7.7%
1
 
3.8%
1
 
3.8%
1
 
3.8%
1
 
3.8%
Nonspacing Mark
ValueCountFrequency (%)
5
31.2%
4
25.0%
2
 
12.5%
2
 
12.5%
1
 
6.2%
ಿ 1
 
6.2%
1
 
6.2%
Other Symbol
ValueCountFrequency (%)
4
50.0%
💪 1
 
12.5%
😎 1
 
12.5%
🤜 1
 
12.5%
🤛 1
 
12.5%
Space Separator
ValueCountFrequency (%)
571
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 22
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Currency Symbol
ValueCountFrequency (%)
$ 1
100.0%
Math Symbol
ValueCountFrequency (%)
| 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 114529
55.4%
Common 91946
44.5%
Devanagari 87
 
< 0.1%
Kannada 9
 
< 0.1%
Greek 8
 
< 0.1%
Cyrillic 6
 
< 0.1%
Inherited 4
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 17973
15.7%
m 15260
13.3%
i 10685
 
9.3%
o 7612
 
6.6%
l 7432
 
6.5%
g 7008
 
6.1%
c 6687
 
5.8%
r 5519
 
4.8%
n 4587
 
4.0%
h 4496
 
3.9%
Other values (48) 27270
23.8%
Devanagari
ValueCountFrequency (%)
14
 
16.1%
6
 
6.9%
5
 
5.7%
5
 
5.7%
5
 
5.7%
4
 
4.6%
4
 
4.6%
3
 
3.4%
3
 
3.4%
3
 
3.4%
Other values (25) 35
40.2%
Common
ValueCountFrequency (%)
0 16218
17.6%
. 9940
10.8%
1 8457
9.2%
2 7361
8.0%
9 6441
 
7.0%
7 6376
 
6.9%
4 6349
 
6.9%
3 6304
 
6.9%
@ 6007
 
6.5%
8 6005
 
6.5%
Other values (20) 12488
13.6%
Kannada
ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
ಿ 1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
Cyrillic
ValueCountFrequency (%)
и 2
33.3%
к 1
16.7%
ѕ 1
16.7%
т 1
16.7%
я 1
16.7%
Greek
ValueCountFrequency (%)
σ 2
25.0%
ρ 2
25.0%
α 2
25.0%
ι 1
12.5%
υ 1
12.5%
Inherited
ValueCountFrequency (%)
4
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 206460
99.9%
Devanagari 87
 
< 0.1%
None 15
 
< 0.1%
Kannada 9
 
< 0.1%
Cyrillic 6
 
< 0.1%
VS 4
 
< 0.1%
Dingbats 4
 
< 0.1%
IPA Ext 3
 
< 0.1%
Emoticons 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 17973
 
8.7%
0 16218
 
7.9%
m 15260
 
7.4%
i 10685
 
5.2%
. 9940
 
4.8%
1 8457
 
4.1%
o 7612
 
3.7%
l 7432
 
3.6%
2 7361
 
3.6%
g 7008
 
3.4%
Other values (67) 98514
47.7%
Devanagari
ValueCountFrequency (%)
14
 
16.1%
6
 
6.9%
5
 
5.7%
5
 
5.7%
5
 
5.7%
4
 
4.6%
4
 
4.6%
3
 
3.4%
3
 
3.4%
3
 
3.4%
Other values (25) 35
40.2%
VS
ValueCountFrequency (%)
4
100.0%
Dingbats
ValueCountFrequency (%)
4
100.0%
Cyrillic
ValueCountFrequency (%)
и 2
33.3%
к 1
16.7%
ѕ 1
16.7%
т 1
16.7%
я 1
16.7%
None
ValueCountFrequency (%)
ø 2
13.3%
σ 2
13.3%
ρ 2
13.3%
α 2
13.3%
💪 1
6.7%
🤜 1
6.7%
🤛 1
6.7%
ι 1
6.7%
υ 1
6.7%
ë 1
6.7%
Kannada
ValueCountFrequency (%)
1
11.1%
1
11.1%
1
11.1%
ಿ 1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
1
11.1%
Emoticons
ValueCountFrequency (%)
😎 1
100.0%
IPA Ext
ValueCountFrequency (%)
ɩ 1
33.3%
ɭ 1
33.3%
ɘ 1
33.3%

Twitter
Text

MISSING 

Distinct1861
Distinct (%)89.9%
Missing402495
Missing (%)99.5%
Memory size3.1 MiB
2024-07-17T20:57:58.192603image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Length

Max length96
Median length46
Mean length16.588605
Min length1

Characters and Unicode

Total characters34355
Distinct characters134
Distinct categories14 ?
Distinct scripts6 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1845 ?
Unique (%)89.1%

Sample

1st rowमहिसौर चाँदसराय जन्दाहा हज़रत वैशाली हाजीपुर बिहार नोएडा युवा चन्दन यादव नेता जी समाजवादी पार्टी
2nd rowBusinessmen
3rd row100001902805040.00
4th rowgujjarsanketh@gmail.com
5th row911136412318375.00
ValueCountFrequency (%)
in 182
 
7.4%
ltd 13
 
0.5%
student 12
 
0.5%
pvt 11
 
0.4%
10
 
0.4%
company 10
 
0.4%
india 8
 
0.3%
limited 7
 
0.3%
enterprises 7
 
0.3%
bank 6
 
0.2%
Other values (2062) 2192
89.2%
2024-07-17T20:57:58.426776image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 4691
 
13.7%
1 1998
 
5.8%
2 1774
 
5.2%
a 1622
 
4.7%
7 1562
 
4.5%
. 1557
 
4.5%
6 1542
 
4.5%
8 1531
 
4.5%
4 1530
 
4.5%
3 1530
 
4.5%
Other values (124) 15018
43.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 19070
55.5%
Lowercase Letter 11482
33.4%
Other Punctuation 1985
 
5.8%
Uppercase Letter 1027
 
3.0%
Space Separator 590
 
1.7%
Other Letter 107
 
0.3%
Spacing Mark 43
 
0.1%
Nonspacing Mark 25
 
0.1%
Other Symbol 6
 
< 0.1%
Connector Punctuation 5
 
< 0.1%
Other values (4) 15
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
10
 
9.3%
9
 
8.4%
8
 
7.5%
7
 
6.5%
7
 
6.5%
6
 
5.6%
6
 
5.6%
5
 
4.7%
5
 
4.7%
5
 
4.7%
Other values (23) 39
36.4%
Lowercase Letter
ValueCountFrequency (%)
a 1622
14.1%
i 1200
 
10.5%
m 1130
 
9.8%
n 786
 
6.8%
o 773
 
6.7%
l 679
 
5.9%
r 615
 
5.4%
c 572
 
5.0%
g 556
 
4.8%
e 514
 
4.5%
Other values (22) 3035
26.4%
Uppercase Letter
ValueCountFrequency (%)
S 120
 
11.7%
A 92
 
9.0%
C 73
 
7.1%
T 64
 
6.2%
B 62
 
6.0%
I 57
 
5.6%
E 53
 
5.2%
D 51
 
5.0%
R 51
 
5.0%
P 51
 
5.0%
Other values (16) 353
34.4%
Decimal Number
ValueCountFrequency (%)
0 4691
24.6%
1 1998
10.5%
2 1774
 
9.3%
7 1562
 
8.2%
6 1542
 
8.1%
8 1531
 
8.0%
4 1530
 
8.0%
3 1530
 
8.0%
5 1463
 
7.7%
9 1449
 
7.6%
Nonspacing Mark
ValueCountFrequency (%)
9
36.0%
5
20.0%
4
16.0%
2
 
8.0%
2
 
8.0%
1
 
4.0%
1
 
4.0%
1
 
4.0%
Other Punctuation
ValueCountFrequency (%)
. 1557
78.4%
@ 414
 
20.9%
& 6
 
0.3%
' 3
 
0.2%
# 3
 
0.2%
; 1
 
0.1%
/ 1
 
0.1%
Spacing Mark
ValueCountFrequency (%)
24
55.8%
ि 8
 
18.6%
7
 
16.3%
1
 
2.3%
1
 
2.3%
1
 
2.3%
1
 
2.3%
Other Symbol
ValueCountFrequency (%)
💝 3
50.0%
🏣 1
 
16.7%
🏤 1
 
16.7%
😉 1
 
16.7%
Math Symbol
ValueCountFrequency (%)
| 3
75.0%
× 1
 
25.0%
Space Separator
ValueCountFrequency (%)
590
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 4
100.0%
Open Punctuation
ValueCountFrequency (%)
( 4
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 21671
63.1%
Latin 12504
36.4%
Devanagari 162
 
0.5%
Kannada 13
 
< 0.1%
Cyrillic 4
 
< 0.1%
Greek 1
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 1622
 
13.0%
i 1200
 
9.6%
m 1130
 
9.0%
n 786
 
6.3%
o 773
 
6.2%
l 679
 
5.4%
r 615
 
4.9%
c 572
 
4.6%
g 556
 
4.4%
e 514
 
4.1%
Other values (43) 4057
32.4%
Devanagari
ValueCountFrequency (%)
24
 
14.8%
10
 
6.2%
9
 
5.6%
9
 
5.6%
ि 8
 
4.9%
8
 
4.9%
7
 
4.3%
7
 
4.3%
7
 
4.3%
6
 
3.7%
Other values (27) 67
41.4%
Common
ValueCountFrequency (%)
0 4691
21.6%
1 1998
9.2%
2 1774
 
8.2%
7 1562
 
7.2%
. 1557
 
7.2%
6 1542
 
7.1%
8 1531
 
7.1%
4 1530
 
7.1%
3 1530
 
7.1%
5 1463
 
6.8%
Other values (18) 2493
11.5%
Kannada
ValueCountFrequency (%)
2
15.4%
2
15.4%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
Cyrillic
ValueCountFrequency (%)
в 1
25.0%
к 1
25.0%
є 1
25.0%
я 1
25.0%
Greek
ValueCountFrequency (%)
ι 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 34167
99.5%
Devanagari 162
 
0.5%
Kannada 13
 
< 0.1%
None 8
 
< 0.1%
Cyrillic 4
 
< 0.1%
Emoticons 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 4691
 
13.7%
1 1998
 
5.8%
2 1774
 
5.2%
a 1622
 
4.7%
7 1562
 
4.6%
. 1557
 
4.6%
6 1542
 
4.5%
8 1531
 
4.5%
4 1530
 
4.5%
3 1530
 
4.5%
Other values (65) 14830
43.4%
Devanagari
ValueCountFrequency (%)
24
 
14.8%
10
 
6.2%
9
 
5.6%
9
 
5.6%
ि 8
 
4.9%
8
 
4.9%
7
 
4.3%
7
 
4.3%
7
 
4.3%
6
 
3.7%
Other values (27) 67
41.4%
None
ValueCountFrequency (%)
💝 3
37.5%
🏣 1
 
12.5%
🏤 1
 
12.5%
× 1
 
12.5%
ι 1
 
12.5%
ē 1
 
12.5%
Kannada
ValueCountFrequency (%)
2
15.4%
2
15.4%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
1
7.7%
Emoticons
ValueCountFrequency (%)
😉 1
100.0%
Cyrillic
ValueCountFrequency (%)
в 1
25.0%
к 1
25.0%
є 1
25.0%
я 1
25.0%

Unnamed: 10
Text

MISSING 

Distinct1787
Distinct (%)99.4%
Missing402768
Missing (%)99.6%
Memory size3.1 MiB
2024-07-17T20:57:58.553187image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Length

Max length43
Median length33
Mean length22.467186
Min length1

Characters and Unicode

Total characters40396
Distinct characters91
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1785 ?
Unique (%)99.3%

Sample

1st rowychandan369@gmail.com
2nd rowJAIN ENTERPRISES
3rd rowgangulynimesh@gmail.com
4th rowkolagunda.prasad@gmail.com
5th rowmrforever513@gmail.com
ValueCountFrequency (%)
in 12
 
0.6%
4
 
0.2%
enterprises 3
 
0.2%
pradesh 2
 
0.1%
madhya 2
 
0.1%
ahmed6587@gmail.com 2
 
0.1%
india 2
 
0.1%
sumanth261094@gmail.com 1
 
0.1%
ranjithnaidu25@gmail.com 1
 
0.1%
kumarabhijeetabhi@gmail.com 1
 
0.1%
Other values (1828) 1828
98.4%
2024-07-17T20:57:58.769300image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 4704
 
11.6%
m 3901
 
9.7%
i 2745
 
6.8%
o 2152
 
5.3%
. 2004
 
5.0%
l 2002
 
5.0%
g 1848
 
4.6%
c 1815
 
4.5%
@ 1609
 
4.0%
r 1390
 
3.4%
Other values (81) 16226
40.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 30179
74.7%
Decimal Number 6204
 
15.4%
Other Punctuation 3619
 
9.0%
Uppercase Letter 258
 
0.6%
Space Separator 87
 
0.2%
Connector Punctuation 18
 
< 0.1%
Other Letter 13
 
< 0.1%
Spacing Mark 7
 
< 0.1%
Other Symbol 5
 
< 0.1%
Dash Punctuation 3
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 4704
15.6%
m 3901
12.9%
i 2745
 
9.1%
o 2152
 
7.1%
l 2002
 
6.6%
g 1848
 
6.1%
c 1815
 
6.0%
r 1390
 
4.6%
n 1256
 
4.2%
h 1248
 
4.1%
Other values (16) 7118
23.6%
Uppercase Letter
ValueCountFrequency (%)
S 23
 
8.9%
A 22
 
8.5%
B 20
 
7.8%
P 19
 
7.4%
R 19
 
7.4%
I 17
 
6.6%
G 16
 
6.2%
M 15
 
5.8%
T 13
 
5.0%
C 12
 
4.7%
Other values (13) 82
31.8%
Other Letter
ValueCountFrequency (%)
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
1
 
7.7%
Other values (3) 3
23.1%
Decimal Number
ValueCountFrequency (%)
0 1066
17.2%
1 750
12.1%
2 640
10.3%
9 618
10.0%
7 576
9.3%
8 540
8.7%
6 539
8.7%
4 526
8.5%
5 478
7.7%
3 471
7.6%
Other Punctuation
ValueCountFrequency (%)
. 2004
55.4%
@ 1609
44.5%
& 4
 
0.1%
' 1
 
< 0.1%
* 1
 
< 0.1%
Other Symbol
ValueCountFrequency (%)
1
20.0%
🤗 1
20.0%
😃 1
20.0%
😄 1
20.0%
🤣 1
20.0%
Spacing Mark
ValueCountFrequency (%)
3
42.9%
3
42.9%
ि 1
 
14.3%
Nonspacing Mark
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Space Separator
ValueCountFrequency (%)
87
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 18
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 30437
75.3%
Common 9936
 
24.6%
Devanagari 22
 
0.1%
Inherited 1
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 4704
15.5%
m 3901
12.8%
i 2745
 
9.0%
o 2152
 
7.1%
l 2002
 
6.6%
g 1848
 
6.1%
c 1815
 
6.0%
r 1390
 
4.6%
n 1256
 
4.1%
h 1248
 
4.1%
Other values (39) 7376
24.2%
Common
ValueCountFrequency (%)
. 2004
20.2%
@ 1609
16.2%
0 1066
10.7%
1 750
 
7.5%
2 640
 
6.4%
9 618
 
6.2%
7 576
 
5.8%
8 540
 
5.4%
6 539
 
5.4%
4 526
 
5.3%
Other values (13) 1068
10.7%
Devanagari
ValueCountFrequency (%)
3
 
13.6%
3
 
13.6%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (8) 8
36.4%
Inherited
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 40368
99.9%
Devanagari 22
 
0.1%
None 2
 
< 0.1%
Emoticons 2
 
< 0.1%
Misc Symbols 1
 
< 0.1%
VS 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 4704
 
11.7%
m 3901
 
9.7%
i 2745
 
6.8%
o 2152
 
5.3%
. 2004
 
5.0%
l 2002
 
5.0%
g 1848
 
4.6%
c 1815
 
4.5%
@ 1609
 
4.0%
r 1390
 
3.4%
Other values (57) 16198
40.1%
Devanagari
ValueCountFrequency (%)
3
 
13.6%
3
 
13.6%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
1
 
4.5%
Other values (8) 8
36.4%
Misc Symbols
ValueCountFrequency (%)
1
100.0%
VS
ValueCountFrequency (%)
1
100.0%
None
ValueCountFrequency (%)
🤗 1
50.0%
🤣 1
50.0%
Emoticons
ValueCountFrequency (%)
😃 1
50.0%
😄 1
50.0%

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing403444
Missing (%)99.7%
Memory size3.1 MiB

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing393095
Missing (%)97.2%
Memory size3.1 MiB

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing364169
Missing (%)90.0%
Memory size3.1 MiB

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing357956
Missing (%)88.5%
Memory size3.1 MiB

Unnamed: 15
Categorical

HIGH CORRELATION  IMBALANCE  MISSING 

Distinct19
Distinct (%)0.2%
Missing393841
Missing (%)97.3%
Memory size3.1 MiB
0.00
10018 
0.90
 
417
user
 
191
verified
 
38
 
36
Other values (14)
 
25

Length

Max length33
Median length4
Mean length4.0129604
Min length1

Characters and Unicode

Total characters43039
Distinct characters32
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique11 ?
Unique (%)0.1%

Sample

1st rowagarwalprince001@gmail.com
2nd row0.00
3rd row0.00
4th row0.00
5th row0.00

Common Values

ValueCountFrequency (%)
0.00 10018
 
2.5%
0.90 417
 
0.1%
user 191
 
< 0.1%
verified 38
 
< 0.1%
36
 
< 0.1%
0.31 6
 
< 0.1%
0.30 6
 
< 0.1%
0.33 2
 
< 0.1%
11.00 1
 
< 0.1%
100006108225903.00 1
 
< 0.1%
Other values (9) 9
 
< 0.1%
(Missing) 393841
97.3%

Length

2024-07-17T20:57:58.859812image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
0.00 10018
93.7%
0.90 417
 
3.9%
user 191
 
1.8%
verified 38
 
0.4%
0.31 6
 
0.1%
0.30 6
 
0.1%
0.33 2
 
< 0.1%
premium 1
 
< 0.1%
0.32 1
 
< 0.1%
97.00 1
 
< 0.1%
Other values (8) 8
 
0.1%

Most occurring characters

ValueCountFrequency (%)
0 30929
71.9%
. 10458
 
24.3%
9 419
 
1.0%
e 271
 
0.6%
r 236
 
0.5%
s 197
 
0.5%
u 193
 
0.4%
i 84
 
0.2%
v 40
 
0.1%
f 38
 
0.1%
Other values (22) 174
 
0.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 31390
72.9%
Other Punctuation 10461
 
24.3%
Lowercase Letter 1151
 
2.7%
Space Separator 37
 
0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 271
23.5%
r 236
20.5%
s 197
17.1%
u 193
16.8%
i 84
 
7.3%
v 40
 
3.5%
f 38
 
3.3%
d 38
 
3.3%
a 13
 
1.1%
m 9
 
0.8%
Other values (10) 32
 
2.8%
Decimal Number
ValueCountFrequency (%)
0 30929
98.5%
9 419
 
1.3%
3 18
 
0.1%
1 14
 
< 0.1%
2 6
 
< 0.1%
5 1
 
< 0.1%
8 1
 
< 0.1%
6 1
 
< 0.1%
7 1
 
< 0.1%
Other Punctuation
ValueCountFrequency (%)
. 10458
> 99.9%
@ 3
 
< 0.1%
Space Separator
ValueCountFrequency (%)
37
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 41888
97.3%
Latin 1151
 
2.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 271
23.5%
r 236
20.5%
s 197
17.1%
u 193
16.8%
i 84
 
7.3%
v 40
 
3.5%
f 38
 
3.3%
d 38
 
3.3%
a 13
 
1.1%
m 9
 
0.8%
Other values (10) 32
 
2.8%
Common
ValueCountFrequency (%)
0 30929
73.8%
. 10458
 
25.0%
9 419
 
1.0%
37
 
0.1%
3 18
 
< 0.1%
1 14
 
< 0.1%
2 6
 
< 0.1%
@ 3
 
< 0.1%
5 1
 
< 0.1%
8 1
 
< 0.1%
Other values (2) 2
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 43039
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 30929
71.9%
. 10458
 
24.3%
9 419
 
1.0%
e 271
 
0.6%
r 236
 
0.5%
s 197
 
0.5%
u 193
 
0.4%
i 84
 
0.2%
v 40
 
0.1%
f 38
 
0.1%
Other values (22) 174
 
0.4%

Unnamed: 16
Categorical

HIGH CORRELATION  IMBALANCE  MISSING 

Distinct10
Distinct (%)1.4%
Missing403854
Missing (%)99.8%
Memory size3.1 MiB
0.00
432 
0.90
176 
user
63 
verified
 
33
0.31
 
3
Other values (5)
 
5

Length

Max length21
Median length4
Mean length4.2261236
Min length2

Characters and Unicode

Total characters3009
Distinct characters28
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)0.7%

Sample

1st rowdj.rohith41@gmail.com
2nd row0.00
3rd rowverified
4th row0.90
5th rowverified

Common Values

ValueCountFrequency (%)
0.00 432
 
0.1%
0.90 176
 
< 0.1%
user 63
 
< 0.1%
verified 33
 
< 0.1%
0.31 3
 
< 0.1%
dj.rohith41@gmail.com 1
 
< 0.1%
0.30 1
 
< 0.1%
Bihar in 1
 
< 0.1%
tz 1
 
< 0.1%
main tera hero 1
 
< 0.1%
(Missing) 403854
99.8%

Length

2024-07-17T20:57:58.941070image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-17T20:57:59.031637image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
ValueCountFrequency (%)
0.00 432
60.4%
0.90 176
24.6%
user 63
 
8.8%
verified 33
 
4.6%
0.31 3
 
0.4%
dj.rohith41@gmail.com 1
 
0.1%
0.30 1
 
0.1%
bihar 1
 
0.1%
in 1
 
0.1%
tz 1
 
0.1%
Other values (3) 3
 
0.4%

Most occurring characters

ValueCountFrequency (%)
0 1653
54.9%
. 614
 
20.4%
9 176
 
5.8%
e 131
 
4.4%
r 100
 
3.3%
i 71
 
2.4%
s 63
 
2.1%
u 63
 
2.1%
d 34
 
1.1%
v 33
 
1.1%
Other values (18) 71
 
2.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 1838
61.1%
Other Punctuation 615
 
20.4%
Lowercase Letter 552
 
18.3%
Space Separator 3
 
0.1%
Uppercase Letter 1
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
e 131
23.7%
r 100
18.1%
i 71
12.9%
s 63
11.4%
u 63
11.4%
d 34
 
6.2%
v 33
 
6.0%
f 33
 
6.0%
h 4
 
0.7%
a 4
 
0.7%
Other values (9) 16
 
2.9%
Decimal Number
ValueCountFrequency (%)
0 1653
89.9%
9 176
 
9.6%
3 4
 
0.2%
1 4
 
0.2%
4 1
 
0.1%
Other Punctuation
ValueCountFrequency (%)
. 614
99.8%
@ 1
 
0.2%
Space Separator
ValueCountFrequency (%)
3
100.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2456
81.6%
Latin 553
 
18.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
e 131
23.7%
r 100
18.1%
i 71
12.8%
s 63
11.4%
u 63
11.4%
d 34
 
6.1%
v 33
 
6.0%
f 33
 
6.0%
h 4
 
0.7%
a 4
 
0.7%
Other values (10) 17
 
3.1%
Common
ValueCountFrequency (%)
0 1653
67.3%
. 614
 
25.0%
9 176
 
7.2%
3 4
 
0.2%
1 4
 
0.2%
3
 
0.1%
@ 1
 
< 0.1%
4 1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3009
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1653
54.9%
. 614
 
20.4%
9 176
 
5.8%
e 131
 
4.4%
r 100
 
3.3%
i 71
 
2.4%
s 63
 
2.1%
u 63
 
2.1%
d 34
 
1.1%
v 33
 
1.1%
Other values (18) 71
 
2.4%

Unnamed: 17
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing404277
Missing (%)99.9%
Memory size3.1 MiB

Unnamed: 18
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing404453
Missing (%)> 99.9%
Memory size3.1 MiB

Unnamed: 19
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing404539
Missing (%)> 99.9%
Memory size3.1 MiB

Unnamed: 20
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing404559
Missing (%)> 99.9%
Memory size3.1 MiB

Unnamed: 21
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing404560
Missing (%)> 99.9%
Memory size3.1 MiB

Unnamed: 22
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing404562
Missing (%)> 99.9%
Memory size3.1 MiB

Unnamed: 23
Categorical

CONSTANT  MISSING 

Distinct1
Distinct (%)33.3%
Missing404563
Missing (%)> 99.9%
Memory size3.1 MiB
0.0

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters9
Distinct characters2
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row0.0
3rd row0.0

Common Values

ValueCountFrequency (%)
0.0 3
 
< 0.1%
(Missing) 404563
> 99.9%

Length

2024-07-17T20:57:59.120468image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-17T20:57:59.173889image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
ValueCountFrequency (%)
0.0 3
100.0%

Most occurring characters

ValueCountFrequency (%)
0 6
66.7%
. 3
33.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6
66.7%
Other Punctuation 3
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6
100.0%
Other Punctuation
ValueCountFrequency (%)
. 3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 9
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6
66.7%
. 3
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6
66.7%
. 3
33.3%

Unnamed: 24
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing404564
Missing (%)> 99.9%
Memory size3.1 MiB
2024-07-17T20:57:59.217480image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3
Min length2

Characters and Unicode

Total characters6
Distinct characters6
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st rowuser
2nd row
ValueCountFrequency (%)
user 1
50.0%
1
50.0%
2024-07-17T20:57:59.355781image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
u 1
16.7%
s 1
16.7%
e 1
16.7%
r 1
16.7%
1
16.7%
1
16.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 4
66.7%
Currency Symbol 1
 
16.7%
Space Separator 1
 
16.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
u 1
25.0%
s 1
25.0%
e 1
25.0%
r 1
25.0%
Currency Symbol
ValueCountFrequency (%)
1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 4
66.7%
Common 2
33.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
u 1
25.0%
s 1
25.0%
e 1
25.0%
r 1
25.0%
Common
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5
83.3%
Currency Symbols 1
 
16.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
u 1
20.0%
s 1
20.0%
e 1
20.0%
r 1
20.0%
1
20.0%
Currency Symbols
ValueCountFrequency (%)
1
100.0%

Unnamed: 25
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing404563
Missing (%)> 99.9%
Memory size3.1 MiB

Unnamed: 26
Categorical

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing404565
Missing (%)> 99.9%
Memory size3.1 MiB
0.0

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters3
Distinct characters2
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row0.0

Common Values

ValueCountFrequency (%)
0.0 1
 
< 0.1%
(Missing) 404565
> 99.9%

Length

2024-07-17T20:57:59.422864image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-17T20:57:59.469851image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
ValueCountFrequency (%)
0.0 1
100.0%

Most occurring characters

ValueCountFrequency (%)
0 2
66.7%
. 1
33.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2
66.7%
Other Punctuation 1
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2
66.7%
. 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2
66.7%
. 1
33.3%

Unnamed: 27
Categorical

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing404565
Missing (%)> 99.9%
Memory size3.1 MiB
4.0

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters3
Distinct characters3
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row4.0

Common Values

ValueCountFrequency (%)
4.0 1
 
< 0.1%
(Missing) 404565
> 99.9%

Length

2024-07-17T20:57:59.521977image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-17T20:57:59.569578image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
ValueCountFrequency (%)
4.0 1
100.0%

Most occurring characters

ValueCountFrequency (%)
4 1
33.3%
. 1
33.3%
0 1
33.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2
66.7%
Other Punctuation 1
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
4 1
50.0%
0 1
50.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
4 1
33.3%
. 1
33.3%
0 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
4 1
33.3%
. 1
33.3%
0 1
33.3%

Unnamed: 28
Text

MISSING 

Distinct2
Distinct (%)100.0%
Missing404564
Missing (%)> 99.9%
Memory size3.1 MiB
2024-07-17T20:57:59.612234image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Length

Max length6
Median length4.5
Mean length4.5
Min length3

Characters and Unicode

Total characters9
Distinct characters9
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)100.0%

Sample

1st rowBihar
2nd row40*
ValueCountFrequency (%)
bihar 1
50.0%
40 1
50.0%
2024-07-17T20:57:59.735099image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
B 1
11.1%
i 1
11.1%
h 1
11.1%
a 1
11.1%
r 1
11.1%
1
11.1%
4 1
11.1%
0 1
11.1%
* 1
11.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 4
44.4%
Decimal Number 2
22.2%
Uppercase Letter 1
 
11.1%
Space Separator 1
 
11.1%
Other Punctuation 1
 
11.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 1
25.0%
h 1
25.0%
a 1
25.0%
r 1
25.0%
Decimal Number
ValueCountFrequency (%)
4 1
50.0%
0 1
50.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%
Other Punctuation
ValueCountFrequency (%)
* 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 5
55.6%
Common 4
44.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
B 1
20.0%
i 1
20.0%
h 1
20.0%
a 1
20.0%
r 1
20.0%
Common
ValueCountFrequency (%)
1
25.0%
4 1
25.0%
0 1
25.0%
* 1
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 9
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
B 1
11.1%
i 1
11.1%
h 1
11.1%
a 1
11.1%
r 1
11.1%
1
11.1%
4 1
11.1%
0 1
11.1%
* 1
11.1%

Unnamed: 29
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing404565
Missing (%)> 99.9%
Memory size3.1 MiB
2024-07-17T20:57:59.782704image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters3
Distinct characters3
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st rowav4
ValueCountFrequency (%)
av4 1
100.0%
2024-07-17T20:57:59.886291image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 1
33.3%
v 1
33.3%
4 1
33.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 2
66.7%
Decimal Number 1
33.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
a 1
50.0%
v 1
50.0%
Decimal Number
ValueCountFrequency (%)
4 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 2
66.7%
Common 1
33.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
a 1
50.0%
v 1
50.0%
Common
ValueCountFrequency (%)
4 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
a 1
33.3%
v 1
33.3%
4 1
33.3%

Unnamed: 30
Categorical

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing404565
Missing (%)> 99.9%
Memory size3.1 MiB
2.0

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters3
Distinct characters3
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row2.0

Common Values

ValueCountFrequency (%)
2.0 1
 
< 0.1%
(Missing) 404565
> 99.9%

Length

2024-07-17T20:57:59.951624image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-17T20:58:00.000121image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
ValueCountFrequency (%)
2.0 1
100.0%

Most occurring characters

ValueCountFrequency (%)
2 1
33.3%
. 1
33.3%
0 1
33.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2
66.7%
Other Punctuation 1
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
2 1
50.0%
0 1
50.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
2 1
33.3%
. 1
33.3%
0 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 1
33.3%
. 1
33.3%
0 1
33.3%

Unnamed: 31
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing404565
Missing (%)> 99.9%
Memory size3.1 MiB
2024-07-17T20:58:00.041981image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters4
Distinct characters4
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row3o 5
ValueCountFrequency (%)
3o 1
50.0%
5 1
50.0%
2024-07-17T20:58:00.156862image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
3 1
25.0%
o 1
25.0%
1
25.0%
5 1
25.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2
50.0%
Lowercase Letter 1
25.0%
Space Separator 1
25.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
3 1
50.0%
5 1
50.0%
Lowercase Letter
ValueCountFrequency (%)
o 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3
75.0%
Latin 1
 
25.0%

Most frequent character per script

Common
ValueCountFrequency (%)
3 1
33.3%
1
33.3%
5 1
33.3%
Latin
ValueCountFrequency (%)
o 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 4
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3 1
25.0%
o 1
25.0%
1
25.0%
5 1
25.0%

Unnamed: 32
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing404566
Missing (%)100.0%
Memory size3.1 MiB

Unnamed: 33
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing404566
Missing (%)100.0%
Memory size3.1 MiB

Unnamed: 34
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing404565
Missing (%)> 99.9%
Memory size3.1 MiB
2024-07-17T20:58:00.209976image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Length

Max length6
Median length6
Mean length6
Min length6

Characters and Unicode

Total characters6
Distinct characters6
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st rowBihar
ValueCountFrequency (%)
bihar 1
100.0%
2024-07-17T20:58:00.327389image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
B 1
16.7%
i 1
16.7%
h 1
16.7%
a 1
16.7%
r 1
16.7%
1
16.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 4
66.7%
Uppercase Letter 1
 
16.7%
Space Separator 1
 
16.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i 1
25.0%
h 1
25.0%
a 1
25.0%
r 1
25.0%
Uppercase Letter
ValueCountFrequency (%)
B 1
100.0%
Space Separator
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 5
83.3%
Common 1
 
16.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
B 1
20.0%
i 1
20.0%
h 1
20.0%
a 1
20.0%
r 1
20.0%
Common
ValueCountFrequency (%)
1
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
B 1
16.7%
i 1
16.7%
h 1
16.7%
a 1
16.7%
r 1
16.7%
1
16.7%

Unnamed: 35
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing404566
Missing (%)100.0%
Memory size3.1 MiB

Unnamed: 36
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing404566
Missing (%)100.0%
Memory size3.1 MiB

Unnamed: 37
Categorical

CONSTANT  MISSING 

Distinct1
Distinct (%)50.0%
Missing404564
Missing (%)> 99.9%
Memory size3.1 MiB
0.31

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters8
Distinct characters4
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.31
2nd row0.31

Common Values

ValueCountFrequency (%)
0.31 2
 
< 0.1%
(Missing) 404564
> 99.9%

Length

2024-07-17T20:58:00.392221image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-17T20:58:00.445845image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
ValueCountFrequency (%)
0.31 2
100.0%

Most occurring characters

ValueCountFrequency (%)
0 2
25.0%
. 2
25.0%
3 2
25.0%
1 2
25.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 6
75.0%
Other Punctuation 2
 
25.0%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 2
33.3%
3 2
33.3%
1 2
33.3%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 8
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 2
25.0%
. 2
25.0%
3 2
25.0%
1 2
25.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 8
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 2
25.0%
. 2
25.0%
3 2
25.0%
1 2
25.0%

Unnamed: 38
Categorical

CONSTANT  MISSING 

Distinct1
Distinct (%)50.0%
Missing404564
Missing (%)> 99.9%
Memory size3.1 MiB
0.0

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters6
Distinct characters2
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row0.0

Common Values

ValueCountFrequency (%)
0.0 2
 
< 0.1%
(Missing) 404564
> 99.9%

Length

2024-07-17T20:58:00.520161image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-17T20:58:00.584717image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
ValueCountFrequency (%)
0.0 2
100.0%

Most occurring characters

ValueCountFrequency (%)
0 4
66.7%
. 2
33.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 4
66.7%
Other Punctuation 2
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 4
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 4
66.7%
. 2
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 4
66.7%
. 2
33.3%

Correlations

2024-07-17T20:58:00.621586image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
CarrierUnnamed: 15Unnamed: 16
Carrier1.0000.0000.000
Unnamed: 150.0001.0000.825
Unnamed: 160.0000.8251.000

Missing values

2024-07-17T20:57:51.086501image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
A simple visualization of nullity by column.
2024-07-17T20:57:51.881439image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-07-17T20:57:54.157699image/svg+xmlMatplotlib v3.9.1, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

NumberCarrierNameGenderAddressJobTitleCompanyNameEmailFacebookTwitterUnnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20Unnamed: 21Unnamed: 22Unnamed: 23Unnamed: 24Unnamed: 25Unnamed: 26Unnamed: 27Unnamed: 28Unnamed: 29Unnamed: 30Unnamed: 31Unnamed: 32Unnamed: 33Unnamed: 34Unnamed: 35Unnamed: 36Unnamed: 37Unnamed: 38
0919060000019.0TelenorP N. RNaNBiharNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
1919060000064.0TelenorRajat AroraNaNBihar inNaNNaNrajata860@gmail.comNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
2919060001071.0TelenorAnil Giri Dalia SpeciliestNaNBiharNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
3919060001331.0TelenorZameerNaNBiharNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
4919060001497.0TelenorNaveen NaveenNaNBihar inNaNNaNnaveenkuta3101@gmail.comNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
5919060001485.0TelenorJob GovindaNaNBiharNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
6919060001479.0TelenorParashuram ShobhaNaNBihar inNaNNaNparshuramk1970@gmail.comNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
7919060001473.0TelenorPp PNaNBihar inNaNNaNkss@gmail.comNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
8919060001472.0TelenorMd SameerNaNBihar inNaNNaNmdsameershaikh75@gmail.comNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
9919060001469.0TelenorHamid HamiNaNBiharNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
NumberCarrierNameGenderAddressJobTitleCompanyNameEmailFacebookTwitterUnnamed: 10Unnamed: 11Unnamed: 12Unnamed: 13Unnamed: 14Unnamed: 15Unnamed: 16Unnamed: 17Unnamed: 18Unnamed: 19Unnamed: 20Unnamed: 21Unnamed: 22Unnamed: 23Unnamed: 24Unnamed: 25Unnamed: 26Unnamed: 27Unnamed: 28Unnamed: 29Unnamed: 30Unnamed: 31Unnamed: 32Unnamed: 33Unnamed: 34Unnamed: 35Unnamed: 36Unnamed: 37Unnamed: 38
404556917646099235.0TelenorSonu KumarNaNBiharNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
404557917646099230.0TelenorDNaNBiharNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
404558917646099224.0TelenorRaja RamNaNNainpurMadhya PradeshNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
404559917646099192.0TelenorNaNNaNBiharNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
404560917646099177.0TelenorNaNNaNBiharNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
404561917646099115.0TelenorDeepak KumarNaNNainpurMadhya Pradesh inNaNNaNdipaksanjayjee@gmail.comNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
404562917646099095.0TelenorSonuNaNNainpurMadhya PradeshNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
404563917646099087.0TelenorKhalid RazaNaNBihar inNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
404564917646099045.0TelenorNaNNaNBiharNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
404565917646099016.0TelenorNidhi What's AapNaNNainpurMadhya PradeshNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN

Duplicate rows

Most frequently occurring

CarrierNameGenderAddressJobTitleCompanyNameEmailFacebookTwitterUnnamed: 10Unnamed: 15Unnamed: 16Unnamed: 23Unnamed: 24Unnamed: 26Unnamed: 27Unnamed: 28Unnamed: 29Unnamed: 30Unnamed: 31Unnamed: 34Unnamed: 37Unnamed: 38# duplicates
16818TelenorNaNNaNBiharNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN21101
16847TelenorNaNNaNUttar Pradesh EastNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN11129
16849TelenorNaNNaNUttar Pradesh WestNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN8569
16824TelenorNaNNaNBundiRajasthanNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN2201
16819TelenorNaNNaNBihar inNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN1988
16835TelenorNaNNaNGoharganjMadhya PradeshNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN1276
16826TelenorNaNNaNChhabraRajasthanNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN766
16814TelenorNaNNaNBaranRajasthanNaNNaNNaNNaNNaN0.00NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN598
10580TelenorRahulNaNBiharNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN484
16832TelenorNaNNaNGairatganjMadhya PradeshNaNNaNNaNNaNNaN0.00NaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaNNaN427